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Abstract 


Many classical problems of computer science — such as paging, scheduling, and maintaining 
dynamic data structures — are naturally on-line; an algorithm for such a problem is constantly 
making irrevocable decisions without knowing what its future input will be. The competitive 
analysis of on-line algorithms was brought into prominence by the work of Sleator and Tarjan 
in 1985 as a theoretical framework in which to measure the performance of such algorithms. 
Since then, a variety of on-line problems have been studied from this perspective. 

We consider two such problems in this setting: robot navigation, and computationally 
efficient algorithms for the k-server problem of Manasse, McGeoch, and Sleator. For the first 
of these, we give algorithms for a robot searching for a goal in an unknown simple polygon; our 
approach can be viewed as an extension of some of the strategies of Baeza-Yates, Culberson, 
and Rawlins to a more general class of geometric search spaces. We also provide the first 
competitive analysis of the robot localization problem — a fundamental task for an autonomous 
mobile robot, in which it must determine its location in a known environment. 

Finally, we address the general question of how the performance of an on-line algorithm 
varies with the computational resources it has at its disposal. Within the context of the k-server 
problem, a natural specialization of this question is the following: can an algorithm which uses 
only constant space, and constant time in processing each request, match the performance of 
algorithms which are allowed to perform an arbitrary amount of computation at each step? We 
show a non-trivial lower bound for the class of 2-server balancing algorithms, a subclass of the 
constant-time algorithms; this represents one of the first lower bounds for an on-line algorithm 
based solely on its computational resources. 
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Chapter 1 


Introduction 


The distinction between on-line and off-line algorithms is an old one in computer science, yet 
the current surge of on-line algorithms research in the theory community is relatively new. In 
large part this is due to the surprisingly rich structure of the competitive ratio, brought into 
focus by Sleator and Tarjan [ST] in 1985 as a means of analyzing on-line algorithms. 

At a general level, an on-line algorithm is one which receives its input in a piecemeal fashion; 
at each step, it must make irrevocable decisions before seeing the remainder of the input. An off- 
line algorithm, on the other hand, has the luxury of reading the entire input before performing 
any computation. The competitive ratio of an on-line algorithm is then defined to be the worst- 
case ratio of the cost it incurs on a given problem instance to the cost of the optimum (off-line) 
solution. (Readers not familiar with the definitions in this area should read Chapter 2 before 
this Introduction.) 

In this thesis, we consider two naturally on-line problems within the framework of compet- 
itive analysis: robot navigation, and the k-server problem. 

In a typical on-line navigation problem, a robot with vision is placed in a geometric envi- 
ronment for which it does not have a map; it must perform some task in this environment, such 
as constructing a map, or finding a recognizable goal hidden in the environment. Baeza- Yates, 
Culberson, and Rawlins [BCR], in one of the first papers in this area, considered the latter type 
of search problem. They gave algorithms for searching in a number of structured situations, 
including the problem of searching for a point on a line, a line in the plane, and a point in a 


erid graph. The basis for these algorithms is a collection of techniques that have found wide 


application in subsequent papers. 

In Chapter 3, we consider more geometric versions of this search problem — specifically the 
problem of a robot searching for a goal in an unknown rectilinear polygon. This differs from 
the model of [BCR] in that the structure of the space to be searched is not known to the robot; 
the generality of rectilinear polygons also makes possible a much wider class of search spaces. 
The search algorithm we give is fairly natural, performing exploration on larger and larger 
regions of the polygon. This problem of exploration in a simple rectilinear polygon has been 
considered separately by Deng, Kameda, and Papadimitriou [DKP], who gave an algorithm 
with a competitive ratio of 2; we offer a simple randomized variant with competitive ratio 5/4. 

It is also interesting to consider special cases of the search problem in a polygon — re- 
stricting the type of polygon so that better competitive ratios can be obtained. This was the 
approach taken by Klein [kK], who considered the class of streets and gave a search algorithm 
with competitive ratio at most 1+ 3 (~ 5.71). We give an algorithm for streets with compet- 
itive ratio at most \/4 + V8 (~ 2.61), improving on this bound by more than a factor of two. 
A number of other types of polygons may well be amenable to the same type of approach, and 
this appears to be an interesting direction for further work. 

Not all on-line navigation problems assume a complete lack of information about the envi- 
ronment, however. For example, a fundamental task for an autonomous mobile robot is that 
of localization — determining its location in a known environment. This problem arises in set- 
tings that range from the computer analysis of aerial photographs to the design of autonomous 
Mars rovers. Typically, localization occurs in two phases: first the robot determines all possible 
locations consistent with its current local view of the environment; then it moves around, per- 
forming enough “reconnaissance” to determine uniquely where it is. Despite the attention that 
localization has received in the robotics literature, the first theoretical work on it appeared only 
recently in a paper of Guibas, Motwani, and Raghavan [GMR]; they gave geometric algorithms 
for the first part of the problem — enumerating locations consistent with a view — but did not 
address the issue of strategies for the second part. 

In Chapter 4, we consider this aspect of localization, providing algorithms for a mobile robot 
to move around in its environment so as to determine efficiently where it is. We argue that 


localization is a natural setting in which to apply the competitive analysis of on-line algorithms; 
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the distance traveled by the robot is compared to the length of the shortest possible localizing 
tour. The search strategies of [BCR] allow one to obtain a fairly straightforward, and non- 
optimal, localization algorithm; our main result here is an algorithm which makes stronger use 
of the robot’s knowledge of the map in order to improve asymptotically on this approach. An 
interesting feature of our technique is the way in which the robot is able to identify “critical 
directions” in the environment which allow it to perform late stages of the search more efficiently. 

Finally, in Chapter 5, we consider some problems related to the computational resources 
required by an on-line algorithm. The overwhelming majority of research in competitive analysis 
has proceeded along information-theoretic lines; that is, the only factor limiting the performance 
of an on-line algorithm is its lack of information about the nature of future input. But for any 
given problem, a natural question to ask is whether the best competitive ratio attainable by an 
on-line algorithm depends on the computational resources it has at its disposal. We consider 
such questions within the context of the k-server problem — an elegant generalization of paging 
and certain types of scheduling problems, and one of the most well-studied problems in the area 
of on-line algorithms. Here, an algorithm is presented with a metric space M and k mobile 
servers that can move within M. Points of MM request service, one after another, and the 
algorithm must move a server to the location of each request before seeing the next one. The 
goal is to minimize the total distance traveled by the servers. (See Chapter 2 for background 
on the k-server problem.) 

We say that a real-time server algorithm is one which uses only a constant amount of space, 
and constant time per request. This notion has been studied especially with respect to the 2- 
server problem, by Irani and Rubinfeld [IR] and Chrobak and Larmore [CL2]. In this setting it 
has been observed that, while there are computationally expensive 2-server algorithms achieving 
the optimal competitive ratio of 2 [MMS, CL1, CL4], the best known competitive ratio for a 
real-time algorithm is 4 [CL2]. 

In Chapter 5, we give a real-time 2-server algorithm which is 2-competitive in n-dimensional 
space under the £, (“Manhattan”) metric. This considerably extends the class of metric spaces 
known to have optimal real-time algorithms; the technique we use can be seen as a higher- 
dimensional generalization of the elegant “Double-Coverage” algorithm discovered for k servers 


on a line by Chrobak, Karloff, Payne, and Vishwanathan [CKPV]. We also prove a lower bound 
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for 2-server balancing algorithms, a sub-class of the real-time algorithms. Balancing algorithms 
have been proposed for a number of special cases of the k-server problem; they are so named 
because they seek to “balance” the distance traveled evenly among the servers. We show that 
no such algorithm can achieve a competitive ratio better than (5 + V7)/2 (~ 3.82) for the 
2-server problem; this shows that no optimal on-line 2-server algorithm can be expressed as a 
balancing algorithm. This also represents, to our knowledge, one of the first lower bounds for 


an on-line algorithm based solely on its computational resources. 


Some Notation 


We use standard order-of-growth notation (O, 2, 0) throughout; additionally we say that f(n) 


is o(g(n)) if 
f(r) 


lim —~=0 
n—00 g(n) 


and f(n) is w(g(n)) if g is o( f). 
Two different metrics in R@ will be used at various places: L, and Ly. In the L, (“Man- 
hattan”) metric, the distance between points x = (2',...,27) and y = (y',...,y") is given 


by ; 
L,(2,y) = S- |x" _ y'|. 
i=1 


The Ly metric is the standard Euclidean distance; that is, 


a 1/2 
Lo(@,y) = (de - r) 


i=1 
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Chapter 2 


On-Line Algorithms: An Overview 


The study of on-line algorithms stems from the issue of how an algorithm receives its input. In 
the traditional view, all input is present immediately: for example, if the algorithm is supposed 
to be computing the shortest path between two nodes in a graph, then one assumes that it has 
access to entire graph from the start. This is clearly a reasonable assumption in a large number 
of situations; however, there are equally natural situations in which it makes sense to imagine 
that the algorithm is receiving its input in an on-line fashion: the input arrives in a sequence 
of small pieces, and with the arrival of each piece, the algorithm must make certain immediate 
and irrevocable decisions. 

In this sense, there is nothing new about on-line algorithms. Certainly, the difference be- 
tween “one-pass” and “two-pass” algorithms has always been a fundamental issue in designing 
computer programs; and this is essentially the same distinction expressed in different termi- 
nology. In other cases, the circumstances of the problem force us to work within the on-line 
framework — for example, paging algorithms evict pages from memory, call-control algorithms 
route connections through a network, and trading strategies tell investors to buy and sell, all of 
them without knowing what the future sequence of “input” will look like. For such problems, 
there are only on-line algorithms. 

At the same time, however, there has been a tremendous increase in the number of “on- 
line algorithms” papers over the last ten years; this has derived mainly from the increasing 
popularity of competitive analysis as a perspective from which to discuss the performance of 


such algorithms. In the following section, we discuss competitive analysis in general, and 


13 


specifically the paper of Sleator and Tarjan [ST]. 


2.1 Competitive Analysis 


Typically, an on-line algorithm is trying to solve some optimization problem (as in the examples 
cited above). Thus, if A is an on-line algorithm, and o is an input sequence, then the crucial 
parameter which must be minimized in most on-line problems is A(o), the cost incurred by A 
ono. 

For any given problem, we can consider the optimal off-line algorithm OPT, which sees the 
entire input ahead of time and thus always achieves the minimum possible cost on o. In the 
traditional analysis of algorithms, the fundamental question is how computationally “expensive” 
(in terms of time or space complexity) it is to design the algorithm OPT. In the competitive 
analysis of on-line algorithms, one instead takes OPT as a given and asks how close an on-line 
algorithm A can come to achieving the performance of OPT. 

Thus we say that an on-line algorithm A is c-competitive if for some absolute constant /, 
the expression 


A(o)—¢-OPT(c) 


is bounded by @ for all input sequences o. We will sometimes say that A is “competitive” if it 
is c-competitive for some constant c; the infimum of the set of ¢ for which A is c competitive is 
its competitive ratio. 

From a historical point of view, it is quite difficult to identify the first use of this performance 
measure as applied to on-line algorithms. It appears in the bin-packing literature of the 1970’s, 
which considered the well-known problem of determining the fewest number of “bins,” each 
of capacity 1, which are required to contain a set of n objects with weights w ,...,w, < 1. 
This problem is NP-Complete, but a number of very natural heuristics, several of them on-line 
algorithms, give constant-factor approximations to the minimum number of bins required. 

In this setting, an on-line algorithm is simply one which decides where to place the i 
object before seeing the values of w;41,..., Wn. One such example is First Fit, which puts the 
object in the first bin in which it will fit, starting a new bin only if necessary. Garey, Graham, 


and Ullman [GGU] proved that the competitive ratio of First Fit is Z, and an almost exact 
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characterization of the performance of First Fit, as well as a number of other approximation 
algorithms, was given by Johnson in his Ph.D. thesis [J]. In a result very much in the spirit 
of current work in on-line algorithms, Yao proved in 1980 that there is no on-line bin-packing 
algorithm with a performance ratio better than 3 [Yao]. 

The competitive ratio also appears sporadically (and again not cast in the current terminol- 
ogy) in work on graph-coloring. The First Fit algorithm has a natural meaning in this world 
as well: proceed through the vertices one at a time, coloring each with the lowest-numbered 
color possible. While First Fit performs abysmally on some classes of graphs, it has long been 
observed to have relatively good behavior on others. The survey paper of Kierstead and Trotter 
[KT] mentions a number of problems related to the First Fit coloring algorithm; for example, 
Woodall asked in 1974 whether the number of colors used by First Fit was within a constant 
factor of x(G) when G is an interval graph (see also [Kier]). In the new language, this is simply 
the question of whether First Fit achieves a constant competitive ratio for this type of graph. 
In the last five years, there have been a number of papers on “on-line graph coloring” in the 
competitive framework; these will be discussed briefly in Section 2.3. 

But it was the work of Sleator and Tarjan in 1985 [ST] which launched the current deluge 
of papers. [ST] considered two classical problems of computer science, within the setting of 
competitive analysis: the maintenance of a dynamic data structure, and paging. 

On the first of these questions, [ST] dealt specifically with the problem of maintaining a 
linked list under a sequence of insert, delete, and member? requests. This problem had a long 
history, and many heuristics had been proposed. In the context of on-line algorithms, as in the 
actual applications, one is trying to minimize the total number of memory references over the 
course of a request sequence (i.e. one always wants the current item referenced to be as close 
to the front of the list as possible). What Sleator and Tarjan showed was that the well-known 
heuristic MOVE-TO-FRONT (always put the item just accessed at the front of the list) is 2- 
competitive. One thing that makes this result particularly striking is that the competitive ratio 
of such a simple rule should be so low, especially when there is in fact no good characterization 
known for the optimal off-line algorithm. The proof was also interesting for its use of a potential 
function argument, a technique that was to become extremely common in subsequent research 


in on-line algorithms (we will give an example of such an argument in Section 2.2). 
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The results in [ST] concerned with paging gave much stronger lower bounds. Here one 
pictures a computer with & pages of fast memory and an unlimited number of additional pages 
of slow memory (e.g. a disk). When a reference is made to a page not in fast memory, a “page 
fault” occurs: the referenced page must be moved in, and some page currently sitting in fast 
memory must be swapped out. The goal is to minimize the number of page faults, over the 
course of a string o of memory references. In [ST] it was proved that no on-line paging algorithm 
with & pages can be better than k-competitive, and a number of well-known algorithms such 
as LRU (evict the least-recently used page) and FIFO (evict the page that was brought in 
longest ago) match this bound. Of course, from the point of view of operating system design, 
a performance guarantee of k is ridiculously large — a rule such as LRU actually performs 
extremely well in practice — but the result does lend some insight into why looking at previous 
requests should help in designing a paging algorithm. Various extensions to the model of [ST] 
have been proposed [BIRS, KPR], but the question of why on-line paging works so well in real 
life remains an intriguing one. 

Following [ST], the paper [KMRS] analyzed additional on-line strategies for cache manage- 
ment (which is essentially the same as paging in this model), and [BLS] proposed “metrical task 
systems” as an abstract model for studying on-line algorithms. The following year, Manasse, 
McGeoch, and Sleator [MMS] introduced what has become perhaps the most well-known and 


well-studied on-line problem: the k-server problem. 


2.2. The k-Server Problem 


We imagine the following situation. An on-line algorithm A controls k mobile servers which are 
free to move around in some metric space M. A finite request sequence o is now presented to 
the algorithm, one request at a time. Each request is a point in the space M; the algorithm must 
move one of the servers to this point before seeing the next request. The goal is to minimize the 
total distance traveled by the servers, over the entire request sequence. The k-server problem 
generalizes paging in the following sense: if MW is the set of possible pages of memory, with 
the uniform metric (all non-trivial distances are 1), and the servers are the k “slots” of fast 
memory, then the problem becomes that of minimizing the number of page faults. 


In their original paper, Manasse, McGeoch, and Sleator showed that for every metric space 
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M and every on-line algorithm A, there are arbitrarily long request sequences on which 


A(o) > k- OPT(c) — O(1). 


That is, no on-line algorithm can be better than k-competitive in any metric space. Note that 
LRU or FIFO provides a matching upper bound for the uniform metric space; [MMS] also 
provided a 2-competitive 2-server algorithm and a k-competitive k-server algorithm for any 
metric space with only & + 1 points. Based on these special cases, they advanced the k-server 
conjecture: for every metric space M, there is a k-competitive k-server algorithm. At the time 
of this writing, the conjecture is still open; however, significant progress has recently been 
achieved in [KP]. 

One of the tantalizing aspects of this problem is that most natural algorithms — for example, 
the greedy algorithm (move the closest server) as well as most straightforward generalizations 
of LRU — do not achieve any bounded competitive ratio in general. Indeed, it was not known 
initially whether one could even achieve a constant competitive ratio (depending only on k) 
in an arbitrary metric space, even for the case of three servers. This was settled by Fiat, 
Rabani, and Ravid [FRR], who gave a general algorithm with a competitive ratio of at most 
20#les®) Grove improved this bound to 20“) [Gr] using a simple randomized algorithm, and 
derandomization techniques of Ben-David et. al. [BBKTW]. And very recently, Koutsoupias 
and Papadimitriou [KP] have shown that the “work-function algorithm” proposed by Chrobak 
and Larmore [CL4], as by well as other researchers, is at most (2k — 1)-competitive. 

In a separate direction, a number of papers extended the set of metric spaces for which good 
server algorithms were known. Chrobak, Karloff, Payne, and Vishwanathan [CKPV] gave an 
elegant k-competitive algorithm for & servers on the line (i.e. M = R'). The algorithm works 
as follows: let [a,6] denote the interval on the line whose left endpoint is the leftmost server 
and whose right endpoint is the rightmost server. If the request falls outside this interval, move 
only the closest server; otherwise, move the two “neighboring” servers at the same speed toward 
the request until the closer one reaches it. Subsequently, Chrobak and Larmore showed that a 
natural generalization of this algorithm for the case in which M is a tree is also k-competitive 
[CL3]. 


The proof that this algorithm is k-competitive makes use of a short, if somewhat mysterious, 
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potential function argument. We describe the proof for the case of two servers on a line, since 
it provides a good example of how such a proof proceeds. 

A potential function ® is a function defined on the set of possible configurations of the 
on-line and off-line servers. Thus, when any server moves, the value of ® may change. We 
denote the 7*® request in the request sequence o by o;, and the cost incurred by A and OPT 
on this single request by A(o;) and OPT(o;) respectively. Consider the following sequence of 
events: for each 7 = 1,2,..., first OPT serves the request o;, and then A serves o;. We will 
watch how the value of @ changes over the course of these movements of the servers. Suppose 


we can show the following three facts: 
1. ® is always non-negative. 


2. There is a constant c such that whenever OPT serves a request o;, ® goes up by at most 


c-OPT(o9;). 
3. When A serves a request o;, ® decreases by at least A(o;). 


Note that over the course of the whole request sequence, the total of the increases in ® is 
at most c)>; OPT(o;) = ¢-OPT(c), and similarly the sum of the decreases in ® is at least 
>>, A(o;) = A(o). Thus, if we let @) denote the value of ® on the initial configuration of the 


servers, then the fact that ® is always non-negative implies 


A(a) <c¢-OPT (a) + ®. 


That is, A is c-competitive. 

It is not difficult to show (see [MMS]) that we can assume OPT is a lazy algorithm: it 
always moves at most one server on each request, and does not move if it already covers the 
requested point. If x and y are points on the line, let cy denote the distance between them. For 
the case of two servers in M = R', let s,, sy denote the positions of algorithm .A’s servers, 01, 0» 
the positions of OPT’s servers, and Minin the minimum of 8,0; + 8202 and 8102 + 520, (i.e. it is 
the minimum-cost matching between the servers of A and OPT.) The potential function used 


in [CKPV] is an adaptation of one used by Coppersmith, Doyle, Raghavan, and Snir [CDRS]; 
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for our purposes, we can write it as 


® = 2Mnin + $1 $2. 


Clearly ® is always non-negative. When OPT moves a distance d, it cannot affect the second 
term, and it increases the first term by at most 2d. On a request in which A moves only one 
server, say a distance d, the first term will go down by 2d and the second will go up by d. On a 
request in which A moves both servers a distance d’, the first term cannot increase, while the 
second goes down by 2d’. (Verifying each of these facts is a fairly straightforward exercise.) As 
this shows that ® has the required three properties, the proof is complete. 

In this thesis, we will only be considering 2-server algorithms. Note that in this setting, 
there is no “k-server conjecture” — it has been known since the origin of the problem that 
there is a 2-competitive, on-line algorithm for two servers in a general metric space, and that 
no on-line algorithm can achieve a better ratio. (Other 2-competitive 2-server algorithms, based 
on quite different techniques, have been given by Chrobak and Larmore in [CL1, CL4].) Thus 
the emphasis here has been to give competitive algorithms that are as “simple” as possible — 
simple both in the effort required to analyze them, and in the amount of computation they 
must perform on each request. 

Specifically, all known optimal 2-server algorithms use more than a constant amount of 
time and space per request; indeed, when deciding which server to send to a point in M, they 
must perform computations based on the entire set of points constituting the previous requests. 
Given the nature of the problem, it might not have been unreasonable to have conjectured that 
this much computation is required to achieve any constant ratio. This is not the case, however. 
Irani and Rubinfeld [IR] gave the first competitive 2-server algorithm which performs only a 
constant amount of work on each request; its competitive ratio is known to be somewhere 
between 6 and 10. A “fast” 2-server algorithm with a competitive ratio of 4 was subsequently 
given by Chrobak and Larmore [CL2]. This issue of the computational resources required by 


server algorithms will be the subject of Chapter 5. 
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2.3. Other On-Line Problems 


We now give a brief description of some other areas which have been studied from the perspective 
of competitive analysis, before turning to robot navigation, the other main area considered in 
this thesis. By now, many on-line problems have been studied; the following is not meant to 
be comprehensive. 

As mentioned above, the surge of activity following [ST] led to some new results in on-line 
eraph coloring, already a topic of interest in the 1970’s. One basic question was the following: 
is there an on-line graph coloring algorithm with competitive ratio o(n)? Neither First Fit nor 
any of the other standard on-line coloring algorithms are better than O(n)-competitive, which 
is unfortunately also (to within constant factors) the competitive ratio of the algorithm which 


never re-uses a color. This question was answered in the affirmative in 1989 by Lovasz, Saks, 


n 
log* n 


and Trotter [LST], who gave an on-line algorithm with performance ratio O( ). In terms 
of lower bounds, Halldorsson and Szegedy [HS] showed that for every on-line algorithm A and 


integer k, there is a graph with at most &(2* — 1) vertices and chromatic number k on which A 


nr 


pu competitive on all 
og? n 


will use 2* —1 colors. Thus, no on-line algorithm can be better than Q( 
graphs. Related work on on-line graph coloring has been done by Irani [Ir] and Vishwanathan 
[Vi]. 

The problem of virtual circuit routing through a network is naturally on-line, and has been 
studied from that point of view in a number of recent papers. The model here is the following: 
one has a graph G which represents a high-speed network, and users at the nodes of the network 
who wish to transmit large amounts of data to each other. At each step, a pair of users requests 
that a connection be established between them. There are many variations, but in general a 
connection is specified by its two endpoints s and t, the amount of bandwidth requested, and 
the amount of time for which the connection must be in place. It is then the job of the on-line 
algorithm A to decide on a path from s to t on which the connection will be routed. Typically, 
A is trying to minimize the congestion in the network; thus, the cost incurred by A could be 
the maximum amount of bandwidth that passes through any link of G at any point in time. 

The above description is essentially of the model considered in [AAF PW], and in [AAPW] 
when the duration of the connection is unknown but re-routing is allowed. An alternative, but 


similar, scenario is the following: the network does not have sufficient bandwidth to handle all 
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requests; thus the on-line algorithm A must first decide whether to accept the connection, and 
if so, to route it through the network. This is the set-up considered in [AAP, ABFR]; a closely 
related problem is that of scheduling “intervals” on a line [LT]. One recurring theme in this 
work is that randomized algorithms often can achieve competitive ratios that are exponentially 
better than the lower bound for deterministic algorithms. 

A somewhat related problem was proposed by Imase and Waxman [IW], though it has been 
suggested independently in a number of sources. This is the on-line Steiner tree problem: one 
has a metric space M, and n points in M which must be joined together by a minimum-cost 
Steiner tree. The sequence is presented on-line, and each new point must be immediately 
connected to the existing tree. The on-line/off-line distinction here sort of resembles an “urban 
erowth” phenomenon — after the fact, it is much easier to find a reasonably short spanning 
network than when the points are appearing one at a time and must be hooked up immediately. 
[IW] shows that the greedy algorithm (when point x is requested, connect it to the existing 
network via the shortest possible path) is O(log n)-competitive in any metric space; they also 
present a somewhat contrived metric space M in which no on-line algorithm can be better than 
Q(log n)-competitive. Alon and Azar [AA] subsequently showed that in the Euclidean plane, 
no on-line Steiner tree algorithm can be better than Q(—2£"_)-competitive, leaving a slight gap 


loglogn 


between the upper and lower bounds in this case. 


2.4 Robot Navigation 


Maze-solving is an old obsession. The interest in such problems can be seen in the labyrinths of 
mythology and medieval architecture; and, the bridges of Konigsberg notwithstanding, it was 
one of the original motivations for study of graph theory. Maze-solving algorithms phrased in 
the terminology of graphs can be found in work of Tarry and Tremeaux that reaches back a 
century and more [Ore]. More recently, such questions have been addressed in the contexts of 
automata theory by Blum and Kozen [BK] and robotics by Lumelsky and Stepanov [LuS]. 
On-line navigation is concerned with questions somewhat more general than maze-solving: 
the environment will not always be nefarious, the goal not always to escape. Rather, we will be 
interested in a variety of tasks which must be performed in an environment for which the map, 


or some other crucial piece of information, is not known. Although the focus is theoretical, 
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the motivation comes from the ever-growing field of autonomous mobile robots, and we should 
frequently ask whether the problems we consider have some bearing on the real-world issues of 
robotics. 

Navigation in an unknown environment does not fit precisely into the framework described 
above, in that the mobile robot has a more interactive relationship with the “input” than in 
previous problems. Nevertheless, the problem is naturally on-line, in that the environment is 
presented in a piecemeal fashion, as the robot sees new parts of it, and the goal is (in general) 
to minimize the total distance traveled. The recent incorporation of navigation problems into 
the setting of competitive analysis comes mainly from the work of Baeza- Yates, Culberson, and 
Rawlins [BCR], and Papadimitriou and Yannakakis [PY]. 

[BCR] does not speak directly in terms of the competitive ratio, but its focus on the ratio of 
the robot’s distance traveled to the length of the shortest path is clear enough. The questions it 
considers are variations on the theme of searching for an object at an unknown location in the 
plane or on a line. These include, for example, the problem of searching for a point on a line 
or a collection of lines, and the well-known “lost-at-sea” problem, in which one must determine 
the optimal search pattern for finding a line (the shore) in the plane (the ocean) [Be, Is]. 

A fundamental technique introduced in [BCR] is that of spiral search — an uncomplicated 
idea which has proved to be a valuable building block in numerous on-line algorithms. A good 
example of its use is in the problem of a robot searching for a goal which lies on one of n roads 
meeting at the origin. We will assume here that the robot is constrained to move on these roads 
and cannot use vision; however, it will recognize the goal when it comes to it. Moreover, to 
prevent certain pathologies, we will assume that the goal is at least one unit of distance away 
from the origin. 

Clearly no on-line algorithm can be better than Q(n)-competitive for this problem: to 
construct a lower bound, we simulate the robot on a set of roads which does not contain a goal, 
and number the roads r,,...,7, according to when the robot first travels one unit of distance 
on each. We then simply place the goal one unit from the origin on road r,. The robot will 
travel a distance of 2n — 1, whereas the shortest path to the goal is 1. A simple form of the 
spiral search solution to this problem goes as follows: for d = 1,2,4,...,2/,..., explore each 


road out to a distance d and return to the origin. If the goal is found when d = 1, the robot has 
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traveled at most 2n — 1 times the length of the shortest path. Otherwise, suppose the goal was 


found when d = 2/ for some j > 1. Then the total distance traveled by the robot is at most 


So ae 

i=0 
Meanwhile, since the goal was not found when d was equal to 2/~!, the length of the shortest 
path is at least 2/-!. Thus, the robot travels at most 8n times the length of the shortest path. 
In [BCR] it is shown that by modifying these parameters a little, one achieve a competitive 
ratio of 2en — o(n), and that this is optimal up to low-order terms. 

As is the case with most of the basic notions discussed here, one can find the spiral search 
technique being used implicitly in a wide range of previous papers; its wide applicability is 
obviously due to its simplicity — keep doubling until you succeed — and the relative ease of 
analyzing the performance guarantee one gets — the sum of all that you’ve done in previous 
phases is only a constant fraction of what you do in the current phase. Recently, it has been 
used in many of the robot search papers we will discuss below [BCR, PY, BRaSc, KRT, KI], 
an abstract kind of navigation problem known as layered graph traversal [PY, FF KRRV], the 
design of hybrid algorithms [KMSY], and even the approximation of some NP-hard problems 
[BCCPRS, TWSY]. 


Figure 2-1: A rectangle packing 


The work of Papadimitriou and Yannakakis followed [BCR] and deals with navigation prob- 
lems that are somewhat less stylized. Specifically, they consider a robot with vision moving 


around in a plane filled with rectangular obstacles. The robot can move through any part of the 
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plane not filled in by an obstacle, and there is always room to move between adjacent obstacles 
(i.e. they cannot be “stuck together” to make more complex shapes). By vision, both here and 
in what follows, we mean the following: if the robot is at «, it can see the point y if the line 
segment TY does not meet any obstacle. Finally, let d(s,t) denote the length of the shortest 
obstacle-avoiding path between points s and t. 

Based on the above definitions, we will say that a rectangle packing is a collection of axis- 
aligned rectangles, each of which has at least unit thickness, such that there is always just 
enough room for the robot to pass between neighboring rectangles. [PY] deals with shortest- 
path problems in this setting: the robot starts at a point s and knows the coordinates of a point 
t that it wants to reach. The point ¢t is assumed to be in free space — i.e not in the middle of 
an obstacle. Let n denote the straight-line distance from s to ¢; note that this could be much 
less than d(s,t). When all the obstacles in the packing are squares, [PY] gives an algorithm 
with competitive ratio 4/26 ~ 1.70, and shows that no on-line algorithm can be better than 
3-competitive. For the case of an arbitrary rectangle packing, no bounded competitive ratio is 
possible: [PY] shows a lower bound of 2(,/n) on the best possible competitive ratio. 

Blum, Raghavan, and Schieber [BRaSc] address the same type of shortest-paths problems 
and give an O(,/n)-competitive algorithm for the s-¢ path problem in a rectangle packing; this 
matches the lower bound of [PY] up to constant factors. [BRaSc] also introduced the elegant 
“room problem” — consider a rectangle packing inside a large 2n x 2n square room (so there 
is space to move along the walls as well), such that the center of the room is in free space. The 
robot wants to start at one corner s of the room and reach the center ¢. The intriguing point 
here is that there is always a path from s to ¢ of length 2n — simply start at ¢, move west as 
far as possible, then south as far as possible, then west ... and so on — and this construction 
is of course on-line. But can the robot starting at s somehow reverse this process and find a 
path to t of length O(n)? [BRaSc] left this as an open question, providing only an algorithm 
to generate a path of length O(n2VvE"), the question was answered a year later by Bar-Eli, 
Berman, Fiat, and Yan [BBFY], who showed by a very complicated argument that no on-line 
algorithm can be guaranteed to find a path shorter than Q(n log 7), and gave an algorithm with 
a performance guarantee matching this up to constant factors. 


Less work has been done on geometric problems more in the spirit of [BCR] — that is, trying 
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to find a short path to a goal ¢ when neither the map of the environment nor the location of ¢ 
is known. This will be the subject of Chapter 3. 

Finally, there have been several papers on the related problem of exploring an unknown 
environment. Here, the goal is to traverse a path that sees all parts of the environment, both 
the obstacles and the free space. For the problem of exploring the interior (or exterior) of 
a simple rectilinear polygon, Deng, Kameda, and Papadimitriou [DKP] give a 2-competitive 
algorithm. The off-line version of this problem (what is the shortest path for exploring a given 
polygon P?) is perhaps better motivated as the “Shortest Watchman’s Route Problem”; Chin 
and Ntafos [CN] give a polynomial-time algorithm for computing such a route, and some of 
their definitions are used in the algorithm of [DKP]. Betke, Rivest, and Singh [BRiSi] consider 
compact exploration algorithms: the robot always knows the shortest path back to the origin 
whenever it reaches a new point. Their main result is a compact exploration algorithm for 
rectangle packings, which travels at most a constant times the total perimeter length of all 
rectangles in the scene (this is a lower bound for the optimal solution if, for example, we 
assume that the robot has very limited vision). 

For our purposes, exploration algorithms (on suitably restricted parts of the environment) 


will prove useful as subroutines in Chapters 3 and 4. 
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Chapter 3 


Searching an Unknown Polygon 


We are interested in geometric variants of the problem of searching with uncertainty. A natural 
problem in robotics is that of searching for a goal in an unknown polygonal region. For example, 
a robot with vision is placed at a starting point in a simple polygon, and it must traverse a 
path to some target point in the polygon. Both the location of the target and and the geometry 
of the polygon are unknown, but the robot will recognize the target when it sees it. 

As discussed in the previous chapter, on-line search algorithms have generally been devel- 
oped for situations in which the geometry is kept to a minimum — for example, the case of 
searching for a point on one of m concurrent rays [BCR, KRT, KMSY], a line in the plane, 
or a point in an integer lattice [BCR]. Moreover, the structure of the space to be searched is 
assumed to be known — only the location of the target is unknown. [BCR] writes, “...these 
problems are (very simple) models of searching in the real-world. It is very often the case that 
we do not know many of the parameters that are usually taken for granted in designing search 
algorithms.” 

Here, we provide an on-line algorithm for the general problem of searching for a target 
point at an unknown location in an unknown and arbitrary simple rectilinear polygon. Thus, 
the robot must adapt its search pattern as it sees more and more of the polygon. For a given 
rectilinear polygon P, we identify the number of essential cuts of P [CN, DKP] as a fundamental 
parameter in determining the best competitive ratio attainable in searching P — it is easy to 
cast the problem of searching m concurrent rays as a search problem in a polygon with m 


essential cuts, whence the [BCR] lower bound of 2em — o(m) on competitive ratio applies (e 


26 


is being used to denote the base of the natural logarithm here). Thus, the natural question 
is whether there is an algorithm which is O(m)-competitive for the problem of searching an 
arbitrary simple rectilinear polygon with m essential cuts. In Section 3.3, we present such an 
algorithm. 

The algorithm is based on the problem — first considered in [DKP] — of exploring a simple 
rectilinear polygon P, starting from and returning to a fixed point s in P; we use an exploration 
algorithm iteratively as the search proceeds. We show in Section 3.2 how an adaptation of the 
technique in [DKP] gives a randomized algorithm which is 5/4-competitive in the L, norm when 
sis any point inside P. 

Another direction in which one could investigate such search problems is to restrict the 
class of polygons and target points in such a way that a constant competitive ratio can be 
achieved for the search problem. Such an approach has been adopted by Klein in his work on 
streets. The papers [IK, kK] introduce the term street to define a class of general (not necessarily 
rectilinear) polygons with two distinguished points s and t, such that the two st boundary 
chains are mutually weakly visible (see below for an elaboration on this definition). In [Kg], 
Klein gives a 1 + 37-competitive (~ 5.71-competitive) algorithm for finding ¢ from s in an 
unknown street P. However, the algorithm and its analysis are quite involved. In Section 3.1 
we give a simple algorithm with a competitive ratio of at most \/4+ V8 (~ 2.61). Moreover 
when P is rectilinear, it achieves the optimal ratio of 2. We believe it would be interesting in 
general to find other natural classes of polygons that can be searched competitively. 

Finally, a word about the distance metrics used in this chapter. The distance between two 
points as measured in the L, and L» (Euclidean) metrics differs at most by a factor of V2: 
thus, an algorithm which is c-competitive in L, is e¥2-competitive in Ly. With the exception 
of the algorithm for traversing streets, which is analyzed directly in the Euclidean metric, we 
present our results in the conceptually neater framework of the £, metric. In view of the tight 


correspondence between L, and Lz, our final search algorithm is O(m)-competitive in both. 


3.1 Traversing an Unknown Street 


Let P be a simple polygon and s and ¢ two distinguished points on the boundary. The removal 


of s and ¢ would disconnect the boundary into two polygonal chains, L and R. We say that 
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P is a street [IK, K] if each point on the boundary of P can see some point on the opposite 
boundary chain. The goal is for a robot with vision to travel from s to ¢; neither the map of P 
nor the coordinates of ¢ are known. The cost incurred by the robot is the length of the path it 
generates, and its competitive ratio is taken with respect to the length of the shortest s-t path 


in P; distances are measured in the Euclidean metric. 


a) 
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Figure 3-1: Streets and their views 


Figure 3-1(a) can be completed to form a rectilinear street in which ¢ could be just around 
the corner from either X or Y [Kk]. The robot will incur the best worst-case performance if it 
moves directly to segment XY, then to ¢ (it will see ¢ when it reaches XY). This can be at most 
a factor of 2 longer than the shortest path. Curiously, this is the only known lower bound on 


the competitive ratio achievable for the problem. In [Kk], an algorithm with a competitive ratio 
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of at most 1+ or (~ 5.71) is presented. Below, we give an algorithm with competitive ratio at 
most \/4+ V8 (~ 2.61). 

The example of Figure 3-1(a) is central to the proof technique we develop in this section, 
and it highlights a principle that will appear repeatedly in what follows — on-line algorithms 
hate making decisions. Specifically, it is useful in many navigation problems to adopt a strategy 
that preserves the robot’s options for as long as possible. In the figure, suppose the robot at 
point s is moving towards segment XY, but it has not yet decided whether it ultimately wants 
to visit point X or Y. Let us define a polygonal path to be monotone if the z- and y-coordinates 
of the points on the path change from their initial to final values monotonically. Then the key 
observation is that in the £, metric, any monotone path between two points is a shortest path, 
so the robot can defer its decision (X or Y) until it reaches segment XY and still have the 
option of traveling optimally to either point. The related fact for the Euclidean metric is that 
any monotone path between two points in the plane has length at most a factor of V2 times 
the straight-line distance between them. Thus, in this example, the robot can move to segment 
XY before making a decision and travel only /2 times too far (Euclidean distance) in the worst 
case. 

Let P be a street, and assume that the robot is currently located at a point x inside P. The 
robot maintains an extended view of P; this consists of all points on the boundary of P that it 
has seen so far. The robot’s extended view will typically look like the example of Figure 3-1(b). 
We define a cave C to be a connected chain of the boundary of P such that the robot has seen 
the endpoints of the chain but no other points of it. At some point p’ on the robot’s path, these 
two endpoints were on the same line of sight from p’; call the one closer to p’ the “mouth” of 
C’. Each cavemouth v is a reflex vertex of P; in the neighborhood of v, P lies either to the left 
or right of the ray p’v. We accordingly refer to v as being either a left or right cavemouth. 

At any given point in time, it is useful to picture the “forward” direction for the robot 
as being along the positive y-axis; so the negative x-axis lies immediately to its left and the 
positive z-axis immediately to its right. In keeping with this terminology, if u and wu’ are two 
vectors, each with non-negative y coordinate, we will say that wu is to the left of w’ if it forms a 
smaller angle with the negative z-axis. Assume that ¢ has not yet been seen, and the robot has 


maintained the invariant that the points in its extended view immediately to its left and right 
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belong to E and R respectively. In view of these assumptions, we assemble some facts about 
extended views of a street before presenting the algorithm itself. The first is a standard fact 


about shortest paths inside any simple polygon. 
Lemma 3.1 I/ft is contained in a cave C, then the shortest x-t path touches the mouth of C. 


Lemma 3.2 Let p be a point on boundary chain L (resp. R), and let V be the boundary chain 


sp of P contained in L (R). If the robot moves from s to p, it will have seen every point on WV. 


Proof. Every point on boundary chain V must be able to see some point on R; but all such 


lines of sight to R cross the robot’s path from s to p. Thus the robot has seen every point on 


v. 
Lemma 3.3 [fv is a left (right) cavemouth, it belongs to boundary chain L (R). 


Proof. Assume v is a left cavemouth, v € R, and v was seen from point p’. The chain 
determined by a clockwise scan of the boundary from v to s (taken from point p’) is entirely 
contained in Rk. Thus, if the robot were to walk directly from p’ to v, it would have seen all 
of this chain, by Lemma 3.2. But since v is a cavemouth, it would not have seen any point 
on the boundary of P just around the corner from v, which belongs to this chain; this is a 


contradiction. I 


Corollary 3.4 In the extended view, all left cavemouths lie to the left of all right cavemouths. 


If the extended view contains any left cavemouths, we define c; to be the rightmost one. The 
point c, is defined analogously for right cavemouths. If both ¢ and c, are defined, the chain 
between the far endpoints of their respective caves must be completely visible in the extended 
view; otherwise, it would contain an additional left or right cavemouth. Combining this with 


Lemma 3.1 and the fact that ¢ has not been seen, 


Lemma 3.5 The point t lies in the cave of either c, or c,. Consequently, the shortest path 


from x tot touches either cq or c,. 
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Let d(-,-) denote the length of the shortest path between two points in P. The shortest 
path from s to t, denoted by , , is a chain of line segments joined at reflex vertices of P. There 
is a natural order on the vertices of , , determined by traversing it from s to ¢. Our algorithm 
works iteratively, allowing the robot to move from a given vertex of , to a later one, with small 


“detour.” The following theorem provides the main inductive step. 


Theorem 3.6 Assume that the robot is currently located at a verter x € ,. Then it can move 


to a later verter x' € , while traveling at most (\/4 + V8)d(2, 2’). 


Proof. We present an (on-line) algorithm for doing this. Based on the robot’s extended view, 


there are four cases to consider. (See Figure 3-2.) 


Case 3 Case 4 


Figure 3-2: The algorithm at work 


Case 1. If t is visible, the robot moves directly to t. The distance traveled is d(a,t). 

Case 2. If c, (c;) is not defined (there are no right (left) cavemouths), then by Lemma 3.5, 
, passes directly through c;. Thus, the robot moves directly to ¢;, following , the whole way. 

Otherwise, both c and c, are visible. The robot chooses a direction of motion such that 
c; lies to its left and c, lies to its right. We view this as a coordinate system in which the 
robot is the origin and it is moving in the direction of the positive y-axis; thus, c; has negative 


x-coordinate and ¢, has positive z-coordinate. The robot moves in this direction, updating its 
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extended view and the points c;,c,, until one of of the above two cases applies, or one of the 
following two: 

Case 3. The point c (or respectively c,) “jumps” to the opposite side of the y-axis. At the 
moment when this happens, both ¢; and ¢, will lie on the same line of sight. If the robot moves 
in this direction until it hits the nearer one, it will once again be on a point x’ € ,, having 
followed a path from x to 2’ that was monotone with respect to the chosen coordinate system. 
Thus, it has traveled no more than V2 times the distance from x to 2’ along, . 

Case 4. If none of Cases 1, 2, or 3 applies, then there comes a point at which the robot’s 
line of sight to c (¢,) is parallel to the z-axis. At the moment when this happens, the robot 
is “confused”; it can no longer follow a path guaranteed to be monotone to either ¢ or c,. 
However, the robot can adopt the following approach to return to the shortest path. 

Let us translate the coordinate system so that the robot is again at the origin (so c; now 
lies on the negative x-axis). Since ¢ has not yet been seen, and since the second quadrant (i.e. 
{ut <0, y > 0}) is free of cavemouths, the robot can see all of the contiguous boundary chain X 
lying in this quadrant. Thus, X must be entirely contained in LE or R; the robot can return to 
, once it discovers which of these cases holds. Define Ly to be the portion of boundary chain 
between ¢ and the endpoint of X lying on the negative z-axis; define Rx to be the portion of 
boundary chain between c, and the endpoint of X lying on the positive y-axis. 

The robot begins moving in the direction of the vector (—1,1), updating its extended view 


and the points ¢;, c,, until it sees ¢ or one of the following events occurs: 
1. ¢ has the same x-coordinate as the robot, or c, has the same y-coordinate as the robot. 
2. One of the chains Ly or Rx becomes completely visible. 


If event (2) occurs first, then the robot will be able to move to the opposite cavemouth, thereby 
returning to ,. Suppose event (1) occurs first; assume for the sake of concreteness that c; has 
the same x-coordinate as the robot (the other case is strictly analogous). Then a point just 
around the corner from c; can only see points lying on X U Ly; this implies that X C R, and 
so the robot can return to , by moving to ¢;. 

We must now bound how far the robot travels by implementing this strategy. Let r denote 


the distance traveled by the robot prior to becoming confused. First consider the case in which 
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event (1) occurs, and c, has the same y-coordinate as the robot. Suppose that the coordinates 
of c, are (v, u); then the robot travels r+u(1+V2)+0, while we have d(x, 2’) > /(r + uy? + 0. 


Thus, the worst-case ratio incurred by the robot is bounded by 


rtu(l+V2)+v 
(r+uy4+v? 


A somewhat lengthy but straightforward argument shows that this expression attains its max- 
imum when r = 0 and w= v(1+ V2), with a value of \/4+ V8. 

Now suppose event (1) occurs, and c has the same z-coordinate as the robot. If the 
coordinates of ¢; are (—u, —v), then the robot travels r+ u(1+/2)+v. Assume that when the 
robot first became confused, the coordinates of c; were (—u,,0) (recall that this point lies on 


the negative z-axis). Let w2 = wu — u,; then we have 


d(a,2x') > /r? + ud + /0? +08 


att < max(4, +), the worst-case ratio is upper-bounded by the 


In view of the simple inequality 


maximum value of 
r+u(1+ V2) 
As above, this is maximized by taking u, = r(1+ V2), with a value of \/4+ V8. 

The case in which event (2) occurs first is similar. Suppose that all of Lx becomes visible 
and the goal has not been seen (the other case is analogous). Then we can set 2’ to be the 
current value of c,. Suppose that the coordinates of c, are (v, wu); then since event (1) did not 
occur, the robot’s y-coordinate is no more than uw. Thus it can get to c, having traveled at most 
r+u(1+ V2) +, while again d(x,2’) > /@ + uj? $0?. 

Finally, we should note that it is possible for the robot’s motion to be stopped by the 
boundary of P. As in Lemma 3.2, however, it will have seen the entirety of one of the caves 
associated with ¢; or c, by the time this happens, so it can return to , ; the preceding analysis 


is not affected. m 


Corollary 3.7 For any street P, repeatedly applying the above algorithm produces a path from 


s tot that is at most \/4+ V8 times as long as the shortest s-t path in P. 
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In the case in which P is rectilinear, the above algorithm can be implemented so that the 
robot’s direction of motion is always along one of the coordinate axes. Then the key observation 
is that Case 4 cannot occur in a rectilinear street (since all angles are right angles, either a 
point just around the corner from ¢; could not see R, or a point just around the corner from c, 


could not see £). Thus we can show 

Theorem 3.8 When P is rectilinear, the above algorithm is \/2-competitive, and this is opti- 
mal. 

3.2 Exploring a Rectilinear Polygon 


First, we present some basic definitions of [CN, DKP] on the structure of rectilinear polygons. 


In this and the remaining sections, distances will be measured in the LE, metric. 


essential cut 


horizon 


essential cut 


Polygon Pm 


Figure 3-3: Some simple rectilinear polygons 


Let P be a simple rectilinear polygon and s a distinguished point in P. An edge e of P is 


contained in a line £; we say that an extended edge is a line segment é€ C € in P which shares one 
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endpoint with e, and whose other endpoint is also on the boundary of P. Each edge e induces 
at most two extended edges. Also, note that an extended edge is either horizontal or vertical, 
depending on the orientation of its associated edge e. Call € a horizon if there is no path from s 
to e that does not cross é. We can define a partial order on horizons as follows: if h and h’ are 
horizons, then h < h’ (h dominates h’) if any path from s to h’ must cross h, or if h = h’. The 
horizons of P which are maximal are called essential cuts; for an essential cut h, it is possible 
to start at s and follow a path which crosses every horizon except h. See Figure 3-3. 

A special case of the following lemma is given in [DKP]; it is the underlying reason for the 


success of “greedy” exploration algorithms in rectilinear polygons. 


Lemma 3.9 Let a,,...,Q, be a set of horizontal and vertical segments in P, and v a point 
in P. The (L,) shortest path beginning at v and touching the a; in order is generated by the 


greedy algorithm, which, from segment a;, always chooses the shortest path to aj;41. 


The problem we are considering here is that of traversing an exploration route in P: a 
path, starting and ending at s, such that every point of P can be seen from some point on 
the path. The robot does not know the map of P in advance. As noted in [CN, DKP] (see 
also the discussion above), a closed path through s has this property if and only if it touches 
all essential cuts in P. In [CN, DKP], it is observed that since any exploration route can be 
traversed (off-line) without self-crossings, the shortest exploration route will touch the essential 
cuts in clockwise order. Moreover, by Lemma 3.9, it will touch the cuts in this order using the 
greedy algorithm. 

Consider the case in which the point s lies on the boundary of P, between the endpoints of 
essential cuts e and e’. The on-line algorithm given in [DKP] is essentially a greedy strategy 
which crosses each upcoming horizon as quickly as possible; it is shown in [DKP] that it will 
traverse the greedy path which touches the essential cuts in clockwise order, beginning with 
e. Consequently, this algorithm finds the optimal exploration route on-line; it is 1-competitive 
when s lies on the boundary of P. 

When s does not lie on the boundary, the choice of which essential cut to start with becomes 


crucial, and the robot does not have enough information to make this choice. 


Proposition 3.10 No deterministic algorithm for exploring a simple rectilinear polygon can 


be better than 5/4-competitive. 
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Cc D 


Add small ” caves” at 


three of the four points 
A,B, ©, D 


Figure 3-4: Lower bound construction 


Proof. Consider Figure 3-4. All the long edges of the polygon P have length 2, the short 
edges have some length ¢ much less than 2, and s is at the center. A robot exploring P crosses 
either the upper or lower horizon first; assume the former case. At this point, it will see two 
tiny “caves” at points A and B, both of which must be visited. Assume that it visits A before 
visiting B or crossing the lower horizon (other cases are similar). 

We now add an extra cave at C but not at D. Even if the robot now had the map of P, 
it would have to travel a distance of 8 to visit the caves at B and C and return to s. It has 
traveled a distance of 2 to reach A; thus its total distance is 10. On the other hand, the greedy 
exploration route which visits C’ first travels a distance of 8. 

The algorithm given in [DKP] is 2-competitive when s is an arbitrary starting point in 
P, and this is the best known deterministic ratio. In the remainder of this section, we give 
a simple randomized algorithm whose performance matches the deterministic lower bound of 


Proposition 3.10. 


Theorem 3.11 There is 5/4-competitive randomized exploration algorithm when s is an arbi- 


trary starting point in P. 


Proof. Consider first the following construction. If the robot standing at s were to imagine 
a thin “needle” of boundary extending from the real boundary of P to s, it would then be 
on the boundary of this new polygon and could explore optimally. If we restrict ourselves to 


horizontal or vertical segments, then there are four possible needles that can be inserted in P. 
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See Figure 3-5. 


Inserting needles in the polygon 


The exploration route E avoiding a needle 


Figure 3-5: Polygon with needles 


Let L(-) denote the length of a path in P, & denote the optimal exploration route in P, 
and P; denote polygon P with the 2*® needle inserted, i = 1,2,3,4. Finally, we denote by 7; the 
(optimal) exploration route generated by the robot starting from s in P; (s is on the boundary 
of each P;). Of course, we are not really interested in the performance of 7; in P;; we must 
show that 7; is also not far from optimal in the original polygon P. Set d; = L(T;) — L(£). 

We claim that ae d; < L(F); the proof is as follows. Since F visits the essential cuts of 
P in clockwise order, it meets each needle in at most one point. FE can be traversed so as to 
avoid the 7" needle (it takes a detour through s); let us denote this longer route by E;. Since 
FE; is an exploration route for P; and 7; is optimal in this polygon, we have L(T;) < L(E;). 

Let d, = L(£;)— L(£). Consider the four points at which F hits the needles (some of these 
points may be s); connect these by shortest paths to form a closed path T. T is not necessarily 


an exploration route for P, but we have L(T) < L(E£) and L(T) = SY, dj. Since d; < di; for 
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each 7, ; ; 
Sidi < Sod, < L(E). 
i=1 i=1 


Thus, if the robot chooses any needle, the exploration route 7; it generates will have length 
at most 2/(#) (and simple examples show that, standing at s, there is no way to choose a needle 
guaranteeing a performance better than this). However, the expected value of the quantity d; 
is bounded by L(F)/4, so if the robot chooses one of the four needles uniformly at random, the 


expected length of the exploration route it generates is at most $Z(F). Wl 


3.3 Searching a Rectilinear Polygon 


Using the exploration algorithm of the preceding section, we can give an O(m)-competitive 
algorithm for the general search problem described at the beginning of this chapter. A point ¢ 
is hidden in a simple rectilinear polygon P with m essential cuts; however, the point ¢ can be 
recognized when it is first seen. A robot starting at s, and without knowledge of the map of P, 
must travel to t; its distance traveled is compared to d(s,t), the length of the shortest s-t path. 
As mentioned in the introduction, O(m) is the best bound possible on the competitive ratio: if 
we take the polygon P,, of Figure 3-3, make the m “arms” extremely long, and introduce tiny 
bends to limit visibility, then the [BCR] lower bound for searching m concurrent rays applies 
— no deterministic algorithm can be better than (2em — o(m))-competitive. (An Q(m) lower 
bound clearly holds for randomized algorithms as well, with a somewhat smaller constant.) 
We first assemble some basic lemmas that will be useful in analyzing the exploration algo- 


rithm. Because P is a simple polygon, we have the following fact. 


Lemma 3.12 Leth and h' be horizons with the same orientation such that for some point v in 
P, every path from s to v must cross both h and h’. Then h and h' are comparable with respect 


tox (hx<h' orh’ <h). 


Based on this lemma, we can represent the partial order < restricted to the horizontal segments 
by a directed tree 7), in which the root is the point s and the other nodes are horizontal horizons. 


For vertical horizons, there is the analogous representation as a tree 7). 


Lemma 3.13 Each of T,, and T, has at most m leaves. 
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Proof. A leaf of 7), corresponds either to a horizontal essential cut, or a horizon A that 
dominates some vertical essential cut h’. In the latter case, no other leaf of 7} can dominate 
h’, by Lemma 3.12, so A can be uniquely charged to h’. The same analysis holds for T,, giving 
us the stated bound. Ii 


Lemma 3.14 Let v be a point in P such that s cannot see v. Then there is some horizon h 
in P that separates s from v, such that for any path from s to h, the point v can be seen from 


some point on this path. (I.e. no matter how the robot gets to h, it will have seen v.) 


Proof. By analogy with the construction for streets, consider the “view” of P from point v. 
Define a cave, as before, to be a connected chain of the boundary of P such that v can see the 
endpoints but no other points of the chain. Since s cannot see v, s must lie in some cave C. 
Let wu be the cavemouth of C; then there is a horizon with endpoint u such that by the time 


the robot reaches this horizon, it will have crossed the ray vu and seen v. 


Lemma 3.15 Suppose the robot is moving perpendicularly towards an extended edge e, and no 
other extended edge of the same orientation lies between the robot and e. Then the robot can 


see both endpoints of e. 


Proof. Suppose the robot could not see the right endpoint of e; let 2 be the rightmost point 
on e that it can see. The line of sight from the robot to « must meet a reflex vertex of P; it is 
easily verified that this is the endpoint of a parallel extended edge e’ lying between the robot 
and ec. il 

Finally, the following lemma is the key to designing the search algorithm; it is also an 
interesting fact in itself. As before, let d(u,v) denote the length of an L£, shortest u-v path in 
P. 


Lemma 3.16 Let P be a simple polygon (not necessarily rectilinear), and consider a robot 
traversing some path in P. If points u and v are both visible from this path, then the robot can 


determine d(u,v) without seeing the rest of P. 


Proof. In fact, it can compute a shortest path between wu and v. Since we are dealing with the 


L, metric, the shortest u-v path in P will not generally be unique. However, some shortest u-v 
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path is polygonal (consists of a finite number of line segments). Consider the extended view of 
P that the robot has generated. By definition, the line segment joining the two endpoints of 
a cave in this view is completely contained in P; let us call it a “pseudo-edge.” Let P’ denote 
the truncated polygon whose boundary consists of the edges and pseudo-edges of the extended 
view of P. Thus, the robot has seen all of the boundary of P’. 

We claim that there is a shortest u-v path in P that does not leave P’. The result will 
follow since the robot can obviously compute a shortest u-v path in P’. Let T be a polygonal 
shortest u-v path in P, which may enter a cave of the extended view, crossing pseudo-edge e 
at the point x. Since v lies in P’, the path must re-enter P’; let us say that it next does so by 
crossing pseudo-edge e’ at point y. Since P is a simple polygon, e = e’. Thus we can form a 
new u-v path which goes directly from x to y when T enters this cave; this operation does not 
increase the length. Proceeding in this way, we eliminate the (finitely many) places at which T’ 


enters a cave, producing a u-v path 7’ in P’ with L(T’) < L(T). Of 


Theorem 3.17 There is an algorithm for searching simple rectilinear polygons, which is O(m)- 


competitive on the class of such polygons with m essential cuts. 


Proof. For any 6 > 0, let P® denote the rectilinear polygon obtained by “truncating” P at 
horizons that are more than 6 away from s. There is some small 6) such that all points within 6 
of s are visible from s. By rescaling, we can assume that 69 = 1. For fixed values of 6, the robot 
will simulate the algorithm of [DKP] in P®, as follows. The [DKP] algorithm has the property 
that at all times, the robot is moving along the coordinate axes, perpendicularly towards an 
extended edge, and it maintains this direction until it reaches the closest such extended edge 
e. By Lemma 3.15, it can see both of the endpoints of the edge e; thus, by Lemma 3.16, it can 
tell whether e lies in P® or not. If e lies in P’, the robot moves towards it, as in the [DKP] 
algorithm; otherwise, e is treated as a wall in the simulation. In this way, the exploration route 
generated by the simulation is the same as the route that the [DKP] algorithm generates in P’. 

Initially, 6 is set to 1. Whenever ¢ is first seen, the robot moves directly to it; if the robot 
explores P’ without seeing ¢, then 6 is doubled and the next iteration begins. 

Recall the partial orders 7, and T, on the horizontal and vertical horizons of P. In P%*, 


these naturally restrict to partial orders T? and T’, which are obtained simply by deleting all 
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nodes corresponding to horizons more than 6 away from s. Both T? and T° are still directed 
trees rooted at s, and by Lemma 3.13, each has at most m leaves. Since each essential cut of 
P® is a leaf of one of these trees, P’ has at most 2m essential cuts. Lemma 3.14 implies that 
one way to explore P’ would be to travel to each of its essential cuts individually; this would 
require a total distance of at most 2m(26) = 4mé. Since the robot is following a 2-competitive 
algorithm, it travels at most 8mé in exploring P*. 

Given this, we can complete the proof as follows. If the robot sees t when 6 = 1, then it 
can travel to it optimally. Otherwise, assume it first sees t in P’, with 6 = 2°. Since it did not 
see t when exploring P?"""), we have by Lemma 3.14 that d(s,t) > 2*71. 

Now, we must bound how far the robot has traveled before reaching ¢. Until it sees ¢, it has 
traveled at most . 


S$" 8m2! < 8m(2"*") < 32md(s,t). 


j=o 
Since 6 = 2" < 2d(s,t), it is easily verified that traveling to t can require no more than an 


additional 3d(s,t). Thus the robot’s path is O(m) times d(s,t), as required. I 


Al 


Chapter 4 


The Robot Localization Problem 


A fundamental task for an autonomous mobile robot is that of localization — determining its 
location in a known environment [Cox, TPBHSS, Wang]. This is also a problem well-known in 
everyday life, where it can be surprisingly easy to become lost even with a compass and some 
knowledge of the terrain — and where it is sometimes crucial whether or not the map one is 
using bears a little mark with the words, “You are here.” 

Thus we are dealing with a robot at an unknown location in an environment for which it 
does have a map. For our purposes, the environment € that we consider will generally be the 
interior of a large polygon filled with a finite number of polygonal obstacles. We also assume € 
is connected, so that there is an obstacle-avoiding path between any two points of €. A robot — 
assumed throughout to possess vision, a map of €, and knowledge of its orientation — “wakes 
up” at some point in €; its goal is to determine where it is. Since we are assuming the robot 
knows its orientation, we will say, using somewhat non-standard terminology, that two regions 
are congruent if one can be obtained from the other by a translation (as opposed to a rigid 
motion). 

The set of all points the robot can see from its current position forms a star-shaped visibility 
polygon P. If there is only one point in € from which the visibility polygon is congruent to 
P, then the robot can immediately determine uniquely where it is. Otherwise — if it is in 
a highly self-repeating environment such as a typical large building — there are a number of 
different places the robot could be. Guibas, Motwani, and Raghavan [GMR] considered the 


question of enumerating all possible locations for the robot, given a visibility polygon P, when 
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the environment € is itself a simple polygon. They gave a “single-shot” algorithm which runs in 
time O(|P||E|), as well as more complicated data structures for answering “localization queries.” 

In a strong sense, though, enumerating the possible locations is only a first step towards 
solving the localization problem. For if the environment is connected and robot begins moving 
around in €, building up larger and larger partial maps P’ D> P of its “neighborhood,” there 
will come a time when there is only one point in all of € at which one can place a partial map 
congruent to P’; the robot has then determined exactly where it is. (We will say that it has 
localized.) 

One way to see that the robot will eventually localize is to note the straightforward fact that 
if it explores all of €, then it can determine where it started from. But this is a crude solution; 
one would like to design an algorithm which causes the robot to travel as little as possible. 
From this point of view, a good localization algorithm is constantly adapting its search based 
on the set of possible locations of the robot, and their position in the map. We can therefore 
phrase the following question: how should the robot move in € so as to determine its location 


efficiently? 


Figure 4-1: The infinite fence problem 


Although the question is quite precise — we can choose to measure efficiency in terms of 
total distance traveled — it is not immediately clear that it can have an interesting answer. 
Indeed, if we consider the polygon of Figure 4-1, a robot starting from the middle has no choice 
but to walk to one of the ends before it can determine where it is; and this is only a factor of 
two better than the trivial approach of exploring the whole environment. So in what sense can 
we say that one algorithm is significantly more efficient than another? 

We argue here that this is an ideal setting in which to use the competitive ratio. Specifically, 
in the polygon of Figure 4-1, the key point to observe is that the robot could not have done 
any better. Even if we had an omniscient robot that knew where it was, it would have to travel 


half the diameter of the polygon before being able to prove that it started from the middle. In 
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this sense, localization becomes an on-line problem: we define a localizing path to be any path 
followed by a robot, at the end of which it can determine uniquely where it is, and compare 
the distance an on-line robot travels before localizing to the length of the shortest possible 
localizing path. The ratio of these two distances is our performance measure for a localization 
algorithm; the worst-case value of this ratio, over all starting points in €, is its competitive 
ratio. 


We find the search for good localization algorithms interesting for two fundamental reasons: 
1. Localization is a problem that arises in numerous practical robotics situations. 


2. The use of the map in helping to guide the robot’s search introduces some complica- 
tions that are not generally found in on-line navigation problems. In this case, it means 
that straightforward on-line techniques such as spiral search are no longer asymptotically 


optimal. 


Applications in Robotics 


The localization problem has been considered in a wide variety of contexts in the robotics 
literature. One application is in the design of robot vehicles that must perform a certain task 
repeatedly in the same environment. Here, localization is used to determine the starting location 
at the beginning of the task, and to maintain positioning information over time [Cox, Dr, Wang]. 
A similar use of localization is in analyzing aerial photographs to determine the location from 
which they were taken [YD]. 

Another major situation in which localization is used is in the design of autonomous ex- 
ploration vehicles, such as the current prototypes for Mars rovers [MAWM, SN]. For example, 
[MAWM] discusses strategies by which a mobile robot can determine its location after a short 
period of “reconnaissance”: in the model considered there, a rough global map of the Mar- 
tian terrain is known; the exploring robot relays local information back to Earth, where an 
obstacle-avoiding path is then planned. 

In a related vein, [TPBHSS] draws a distinction between localization algorithms for “update” 
problems, in which there is some rough estimate of the robot’s starting location, and “drop-off” 


problems, in which there is no initial information. In this terminology, we will be considering 
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the latter type of problem. Again, [TPBHSS] describes the role of “reconnaissance” in the 


localization problem, without providing any concrete algorithms. 


Theoretical Background 


The only previous theoretical treatment of the localization problem is the paper of Guibas, 
Motwani, and Raghavan [GMR], and as mentioned above, it deals only with the “static” version 
of the question. There has been no previous work on this problem from the perspective of 
competitive analysis, though there is a straightforward (and non-optimal) algorithm which is 
not difficult to describe; we will discuss this shortly. 

In what follows, we deal with fairly simple types of environments in order to emphasize the 
combinatorial aspects of the problem rather than the geometric ones. Thus, we consider the 


following two types of environments. 


e Bounded-degree trees embedded in R4. That is, the vertices are realized by points in R4 
and the edges by line segments, and the robot is constrained to move on the edges and 


vertices. If 7’ is an embedded tree, we will use n to denote the number of leaves it has. 


e Rectangle packings in the plane, comprised of n rectangles [PY, BRaSc, BBFY, BRiSi]. 
Recall from Chapter 2 that we assume all rectangles have at least unit thickness, and that 


there is always just enough room for the robot to move between neighboring rectangles. 


Arguably the most natural way to design a localization algorithm for either of these environ- 
ments is the spiral search technique of Baeza-Yates, Culberson, and Rawlins [BCR]. For our 
purposes here, spiral search could be implemented by having the robot iteratively explore all 
points of the environment within distance 1,2,4,...,2’,..., until it knows where it is; the 
resulting algorithm will be O(n)-competitive. 

In navigation problems for which no information about the map is known (such as those of 
the previous chapters), it is frequently not difficult to show that spiral search is optimal up to 
constant factors. Indeed a great deal of work is often done to determine what these constant 
factors are [BCR, KRT, KMSY]. But the fact remains that no algorithm can be better than 


Q(n)-competitive for many of these problems. In some sense, this is not surprising; when the 
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robot knows nothing at all, it is difficult to do much besides a brute-force search out to larger 
and larger radii. 

What we find interesting about the localization problem is that knowledge of the map enables 
the robot to begin focusing its search as it sees more and more of its surroundings; in particular, 
this will lead to a localization algorithm that is o(n)-competitive,’ improving asymptotically 
on spiral search. At a more general level, we are interested in on-line navigation problems in 
which, as is common in real applications, the robot has some limited information about its 
environment. Such problems tend to contain interesting structure that can be exploited when 
designing algorithms, and often provide insight into the value of a map in performing navigation 
tasks. Another example of this is the k-trip shortest-path problem considered by Blum and 
Chalasani [BC]. Here the robot wishes to make k trips between points s and t while minimizing 
the average time per trip; thus it can make use of partial maps of the scene on later trips. For 
a further perspective on the value of different types of information in performing navigation 
tasks, see the work of Donald on “information invariants” [Don]. 

The localization algorithm we present is quite natural and simple to state; the difficulty lies 
in analyzing the competitive ratio. The algorithm performs an initial period of spiral search on 
a local area which is sufficiently restricted to keep the competitive ratio from getting too large. 
At some point during this search, the robot is able to identify one or more “critical directions” 
in the environment; by searching only in these directions, the number of possible locations can 
be eliminated much more quickly. The algorithm then performs a final “clean-up” stage, in 
which the remaining possibilities are eliminated in an iterative fashion. In the case of trees, this 
algorithm is O(n?/*)-competitive. The analysis for rectangular obstacles appears to be quite a 
bit more complicated; however, we are able to show a competitive ratio of O(n ween) = o(n). 
In contrast, the strongest lower bound we can show is Q(,/n), in both types of environments. 
Closing this gap remains an interesting open question. 

In independent work, Dudek, Romanik, and Whitesides [DRW] give a geometric implemen- 
tation of the “shortest-distinguishing-paths” algorithm — which appears as Step 3 in the main 
algorithm of this chapter — when the environment is a simple polygon. Their algorithm is 


O(k)-competitive when the robot initially cannot distinguish among & possible starting points 
Recall that f(n) is o(g(n)) when limnoo f(n)/g(n) = 0. 
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(this number may be as large as Q(n), where n is the number of vertices of the polygon). 


4.1 Lower Bounds and Other Examples 


We give two examples of trees embedded in the plane, both of which show that no localization 
algorithm can be better than 2(,/7)-competitive on trees. In this and the following section, 
we assume that a robot on a tree cannot make use of vision — all it knows is the orientation 
of all edges incident to the vertex it currently occupies. Note that this can be simulated in a 
simple rectilinear polygon by introducing tiny bends everywhere to limit visibility; thus, any 


lower bound for trees immediately carries over to the class of simple polygons. 


Figure 4-2: The first lower bound 


By a branch-vertex, we mean a vertex with degree at least 3. The tree 7’ in Figure 4-2 
consists of p blocks, each of which has p branch-vertices and long “passages” to its neighboring 
blocks. Thus the number of leaves in T is n = 2(2p — 1) + (p — 2)(2p — 2) = 2p? — 2p 4+ 2. 
The robot cannot localize until it has found one of the long passages; by placing it in the block 
whose passage corresponds to one of the branches it will search last, we can force the robot to 
travel Q(p) = Q(/n) times too far. (In fact, there is an algorithm which is O(,/n)-competitive 
on this tree; this is not difficult to see.) 

We now turn to a second Q(,/n) lower bound for trees, which is somewhat more subtle to 
analyze and also has the advantage of extending to rectangular obstacles. The tree 7), is shown 
in Figure 4-3; it consists of a path II of length n, with a path growing north from each vertex. 
All paths but the middle one have length h; the middle one has length h + 1 (the value of h 
will be fixed later). Observe that 7), indeed has n leaves; and the key observation is that the 


robot cannot localize until it has reached one end of II, or traversed the middle path. 


AT 


Figure 4-3: The second lower bound 


First we argue that if h = |/n|, then no algorithm on 7), can be better than Q(./n)- 
competitive. Let v,,...,v, be the vertices of II that lie between A and 2h steps to the west of 
the entrance to the middle path. We place the robot at one of these vertices, such that it will 
traverse all the A other possible north-leading paths before finding the middle one. Thus, it 
will travel at least h? > n —2,/n before finding the end of the middle path; to reach either end 
of II, it must travel at least n/2 — 2,/n. Either way this is a distance of Q(n). Meanwhile, the 
off-line adversary need walk a distance of at most 3,/n to reach the end of the middle path. 

Conversely, however, one cannot improve this lower bound by varying the value of h. Assume 
the robot starts somewhere on the path I, a distance d from the entrance to the middle path. 
(if it starts elsewhere, the arguments are essentially the same). If h < ,/n, then the robot can 
apply the two-way spiral search algorithm, traveling all the way up each new path it encounters. 
This algorithm is O(h)-competitive. And if h > \/n, then the adversary must travel at least a 
distance of \/n to localize, so the robot can ignore all the north-leading paths and simply apply 
the two-way spiral search algorithm to the path IH until the nearer end is reached, traveling a 
distance O(n). 

This latter case suggests some of the difficulty inherent in obtaining a o(n)-competitive 
algorithm — when h = n, the full spiral search algorithm is Q(n)-competitive; it is necessary 
to identify the path II as somehow being more important than the profusion of paths leading 
north. We will see a general technique for doing this in the next section. 

In closing this section, we simply note that it is easy to construct the second lower bound 
out of O(n) rectangles; see Figure 4-4. We use very long rectangles growing north and south 
so that it will take the robot too long to try completely leaving the middle path II. Each of 


the paths growing north out of II is now simulated by two of these very long rectangles with a 
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Figure 4-4: Making the second lower bound work for rectangles 


shorter one in between; also II now bends slightly so that the robot cannot see all the way down 
it. Thus no algorithm can have a performance guarantee better than Q(./n) for environments 


of n rectangular obstacles. 


4.2 The Algorithm for Trees 


Recall that by a geometric tree J’ we mean a pair (V,/), where V is a finite point set in R4 
and F is a set of line segments whose endpoints all lie in V. The segments of F intersect only 
at points in V, and they do not induce any cycles. 

As before, vertices with degree greater than 2 in T will be called branch-vertices. It will 
turn out that the degree-2 vertices of 7’ are largely unimportant in the algorithm; thus we 
change our definition of 7 to an equivalent one without such vertices. Specifically, we will say 
that all the vertices of a geometric tree 7 are either leaves or branch-vertices, and edges are 
now polygonal paths between the vertices. Moreover, we assume for the sake of simplicity that 
T has bounded degree; namely, for some absolute constant A, at most A segments in F are 
incident to any given vertex in V. So we again change notation slightly by saying that 7 has n 
branch-vertices, and consequently has at most (A — 2)n +2 < An leaves (and at least n + 2). 

IfU CV, we use 7(U) to denote the subtree induced by U. When U = {x,y}, this is simply 
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the path from z to y; we denote its length (as a polygonal path under the Euclidean distance) 
by dr(a.y). Finally, recall that the robot is constrained to move on the vertices and edges of 
T, and can make no use of vision other than to know the orientation of all edges incident to its 
current location. 


Consider a geometric tree T with n branch-vertices. We wish to prove the following. 


Theorem 4.1 There is an algorithm which is O(n?/*)-competitive for the localization problem 


on geometric trees. 


To prevent various pathologies, we assume that all the points in V have rational coordinates, 
and that the minimum length of any edge in F is 1. (These assumptions can be avoided at the 


cost of more cumbersome definitions below.) 


ae T' 


o DCT 


Figure 4-5: Lining up 7” with 7 


We can assume without loss of generality that the robot begins at one of the vertices of 
T (ie. at a leaf or a branch-vertex), rather than in the middle of one of the edges. This 
is simply because the robot can initially perform two-way spiral search to reach the closest 
vertex, traveling no more than 9 times too far. Moreover, we can then assume that it in fact 
begins at a branch-vertex, since its only choice at a leaf is to move along the incident edge. 

At all times, the robot maintains a search region, some geometric tree 7’ which consists of 


everything it has seen so far. This means that it is in a part of 7’ which locally looks like 7”; 
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thus the basic computation the robot will be performing as it explores is that of “lining up” its 
current copy of 7” with various parts of 7, in the natural way. The robot maintains its current 
location @ on the map of 7", and a set D, of possible locations v that this might correspond to 
in T. (To keep the notation clear, we will use Greek letters to denote vertices in T’.) If v € Da, 
then we will use Y,,(7") to denote the subtree of 7 induced by rigidly placing T’ on T so that 
a is mapped to v. See Figure 4-5. 

Of course, the robot always does have a “genuine” location in 7; it simply does not know 
what this is (until it has localized). Notationally, it is sometimes useful to refer to this unknown 
location: when the robot is at a € T’, we will denote its true position in T by Z(a). (Note 
that Z(a) € Dy.) 


The following fact is immediate but will be used frequently. 


Lemma 4.2 For alla,@ in T’, Dg is equal to the set D. translated by the vector from a to p. 


In particular, |D.| = |Da\. 


As the robot performs its exploration, it remembers the branch-vertex in 7’ at which it first 
woke up; we will denote this vertex yo. In the course of the localization algorithm, the robot 


maintains four principal quantities of the search region. 


1. As we will see below, 7” is initially constructed using spiral search. Thus we maintain 
the current “search radius” r(7’); this is the distance out to which depth-first search is 


currently being performed, from the initial location yo in T’. 


2. The common value of |D.| (as in Lemma 4.2) will be denoted p(T’); this is simply the 


number of possible placements of 7’ in T. 
3. The number of branch-vertices of T’ will be denoted 6(7"). 


4. The quantity w(7”) is defined to be 


max Da NYo.(T')]. 


a€eT’ vEDa 


That is, if we pick a to be the “origin” of 7”, then we can place it on v so that it covers 


w(T") other origins. 
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The algorithm consists of three main steps, which are controlled by the following global 


structure: 


Initially 6(7”")=1, p(T’) =O(n), w(T’) =1 
While 6(7T’) < p(T’) and 6(T") < n/w(T’) 
Execute Step 1 
If p(T’) < b(T’) then 
Execute Step 3 


Else execute Step 2 followed by Step 3 


In the remainder of this section, we will describe each of the steps individually, then analyze 


the competitive ratio of the resulting algorithm. 


Step 1: Restricted Spiral Search 


The robot first wakes up at some initial location yo € 7’ and begins performing spiral search 
[BCR]. This can be described as follows: the robot starts at yo and performs successive depth- 
first searches so as to see all points within distance 2/ of Z(yo), for j = 0,1,2,.... 


The fundamental fact about spiral search is the following. 


Lemma 4.3 At the end of Step 1, the robot has traveled no more than 8Ab(T") times the length 


of the optimal solution. 


Proof. Suppose the final search radius was r = 2/. So the total distance traveled by the robot 
is at most 


j 

S > 2A0(T")- 2) < QP ADT"). 

7=0 
Since Step 1 did not terminate when the search radius was equal to 2/~1, any localizing path 
must travel a distance of at least 2’-' away from yo. The bound follows. Ii 
Step 2: Extending the Critical Path 


If P is a directed polygonal path and k > 1, we use P* to denote the path formed by joining 
together k copies of P in succession. We use P~' to denote P with the edges presented in 


the reverse order. Let U = {uj,...,u,} be a subset of the vertices of T. We will say that U 
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Figure 4-6: (a) A periodic path. (b) A comb tree. 


induces a periodic path if T(U) is a simple path and there is some polygonal path P such that 
T (uj, Uzi) = P™ for natural numbers m,,...,™m,-1. We will say that U induces a comb tree 
if T(U) consists of a periodic path on vertices {v,...,v,} distinct from U (the base), together 
with disjoint paths 7’(u;,v;) (the teeth), all of which are mutually congruent; the vertex v; will 
be called the support point of u;. (See Figure 4-6.) 

Recall that at the beginning of Step 2, we have an “origin” a of 7’ and a placement Y,,, of 


T’ which covers at least w(Z’) other origins in T. We will use W to denote this set 
Di Veale) 


of covered origins. 
Lemma 4.4 W induces either a periodic path or a comb tree in T (and hence also in T"). 


Proof. The proof is based on the following two claims, whose proofs are given in the Appendix 


of this chapter. 


Claim 4.5 Let T be a geometric tree, r,z,y,z € V. Suppose that T(r,x) is congruent to 
T(y,z), and T(r,y) is congruent to T(x,z). Then T(r,x,y,z) ts either a periodic path or a 


comb tree. 
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Claim 4.6 Let T be a geometric tree and U a subset of the vertices. Suppose that there is some 
« €U such that for all other y,z € U, T(a,y, 2) is either a periodic path or a comb tree. Then 


T(U) is a periodic path or a comb tree. 


Using these, we prove the lemma as follows. Recalling that W = D,NYq,,(T"), let us suppose 
that W = {w.,...,w,}, with w; = v. Consider any other w;,w,; with 1 < i<j <k. Since these 
belong to D,, there is a path congruent to T(w), w;) emanating from w;, and a path congruent 


to T(w, w;) emanating from w;. Note that these two paths have a common endpoint (at the 


point w, +(w; — w1)+(w; —w)). Let u denote this common endpoint; then wy, w;, w;, u satisfy 
the hypotheses of Claim 4.5. This in turn shows that W and the distinguished point wy, satisfy 
the hypotheses of Claim 4.6, and hence T(W) is either a periodic path or a comb tree. 

But suppose that T(W) is in fact a comb tree. Then by Lemma 4.2, there is a vertex a’ € T” 
such that D, contains the set W’ of support points of T(W) (which form a periodic path). In 


fact, if we let v' denote the support point of v, we see that W’ C Dar N Yay, So 
|D a q Yor u'| > |W | > w(T"). 


Thus, by using a’ instead of a, we obtain a set W’ (satisfying |W’| > w(T’)) which induces a 


periodic path. We state this as the following extension of the previous lemma. 
Lemma 4.7 W can be chosen so that it induces a periodic path in T (and in T’). 


Suppose 7(W) = T’(W) = P™ is such a periodic path, with origin a. The robot moves 
to a; it is now standing somewhere in the middle of a long periodic path, and can thus follow 
successive copies of P by moving in one direction (the forward direction), and successive copies 
of P~' by moving in the other (the backward direction). The robot moves in each of these 
directions until it determines the largest 7 and j for which it is possible to traverse P’ in the 
forward direction and P~! in the backward direction, starting from a. 


Locating a and v, and traversing P’ and P~/, constitutes Step 2. Note the following facts. 


Lemma 4.8 In performing Step 2, the robot travels no more than 4+ times the length 


8n 
w(T’)-1 


of the optimal solution. 
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Proof. Moving to a costs at most 2r(7’). 7’ contains P™ as a path, and m > w(T"), so the 


2r(T') 
w(T’)-1° 


length of P (and hence P~') is at most Also, each copy of P uses up an additional 


branch-vertex of T, so the robot will traverse at most n copies of P and P~'. Since each copy 


is traversed twice (the robot returns to a), it travels at most 2r(T’) + an. On the other 


hand, the length of the optimal solution is at least r(T’)/2, as argued above. Ii 
Lemma 4.9 At the end of Step 2, p(T’) < 2n/w'(1"). 


Proof. For v € D, and wu any other vertex of T', we will say that wu is P-covered (resp. P~'- 
covered) by v if by starting at » and following successive copies of P (resp. P~'), the robot can 
reach u. By considering the path P” in Y,,, we see that the total number of vertices that are 
P-covered or P~'-covered by each v € D, is at least w(T’) (counting v itself). Reversing the 
names of P and P~! if necessary, we can assume that the average number of vertices P-covered 
by a vertex v € D, is at least w(T’)/2. 

On the other hand, we claim that no vertex of D, P-covers any other. This is simply 
because it is well-defined for each v € D, how far one can move along 7 following the edge 
sequence of P”; thus if » P-covers v’, the robot can eliminate at least one of v or v’ from its 
set of possible locations. 

This in turn implies that no vertex wu € T is P-covered by more than one member of D, 
(if it were covered by two, the one farther from wu would cover the one closer to uw). Since the 
average number of vertices covered by a vertex v € D, is at least w(T’)/2, and each vertex is 


covered at most once, 


Dal = aT") < Say 


Step 3: Cleaning Up 


Once p(T’) has become sufficiently small, the robot can finish its task by brute force. Specifi- 
cally, assume that it is currently located at an origin ain 7’. For each pair of vertices v, v' € Da, 
define their shortest distinguishing path to be the shortest path Q such that it is possible to 
traverse @ starting from v but not from v’ (or vice versa). (This is the least one has to travel 


to tell v from v’.) 
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In Step 3, the robot iteratively applies the following strategy. Over all v,v’ € D,, it chooses 
the pair with the distinguishing path of minimum length. By following this path, the robot will 
be able to eliminate either v or v from D,, and perhaps both. Meanwhile, the optimal off-line 
algorithm must travel at least this far, since there is no way to eliminate even a single vertex 
from D, otherwise. The robot then returns to a and begins the next iteration. As there are at 


most p(7’) — 1 such iterations, we have proved 


Lemma 4.10 In Step 3, the robot travels no more than 2p(T’) — 2 times the length of the 


optimal solution. 


The Global Structure 


Finally, we give an absolute bound on the competitive ratio, using the lemmas above. Note first 
of all that the tree 7), of Figure 4-3, with h = n, shows that the algorithm which simply applies 
Step 1 until p(7’) < 6(7") and then switches to Step 3 is no better than O(n)-competitive. But 
introducing the option of Step 2 prevents the initial period of spiral search from going on for 


too long. The crucial fact is the following. 
Lemma 4.11 By the time b(T') exceeds n?/*, the robot will have stopped executing Step 1. 


Proof. For a € T’ and v € Dg, let wa,(T’) denote the cardinality of Da N Yo,,(1") — that is, 
the number of origins in D, covered if 7’ is placed so that a corresponds to v € T’. To prove 
the lemma, it is sufficient to show that if both b(7T’) and p(T") are greater than 7/3, then some 
Wa»(T") is at least ni/3. 
For a given v € 7, let 
Ey ={a:ve Dy} 


and e, = |£,|. Thus, 
Soe, = O(T")p(T"). 


vEV 
Now let us compute the sum $' of w,,(Z") over all pairs (a,v) such that v € Dy. Fix such 
a pair (a,v). First, it contributes once to wa,(7"). Also, for every other 8 € F,, there is 


some u € T such that v corresponds to # in Y,,(7"); thus, the pair (a, v) contributes once to 
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Wau(L’). Thus (a, v) contributes a total of e, to the sum $. Summing over all pairs, we obtain 


S- WaT") = S- e. 


Since 7’ has n branch-vertices, this value is minimized by setting each e, equal to 
1 1 
— » = —W(T’) p(T" 
7 > ey = (TT) 


and thus 
1 


1 
av= 25 -—)(T')? T')? = —b(T'Y? T!)?. 
Yan = Sm EMP P LY = THT AT 
By the pigeonhole principle, applied to the 6(7T”) choices of a and the p(7”) choices for v € a, 


there is some pair v,a@ for which 


as desired. fi 

First suppose the robot goes directly from Step 1 to Step 3. When this transition happens, 
b(T') < n?/? and p(T’) < n?/, so by Lemmas 4.3 and 4.10, it travels at most O(n7/3) times the 
length of the optimal solution. 

Otherwise, the robot goes from Step 1 to Step 2, the transition occurrring when 6(7”) and 
n/w(T") are at most n?/3. Thus by Lemmas 4.3 and 4.8, it travels at most O(n?/*) times too 
far in Steps 1 and 2. Now, Lemma 4.9 implies that it will begin Step 3 with p(T") < 2n7/3, so 
by Lemma 4.10, it travels only O(n?/*) times too far in Step 3 as well. As these are the only 


two cases, this completes the proof of Theorem 4.1. 


4.3 The Algorithm for Rectangles 


Again, to keep complications related to visibility to a minimum, we work with a rectangle 
packing [BRaSc, BBFY, BRiSi], as defined earlier. By a vertex of the environment, we will 
mean a corner of some rectangle. So in the spirit of the previous section, one could picture 


a planar graph embedded in the two-dimensional integer grid, all of whose bounded faces are 
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rectangles. 

The algorithm for the case of rectangular obstacles is very similar to the one for trees; the 
main difference is the lack of an analogue to Lemma 4.4 to provide the robot with an obvious 
critical path to explore. As a result, the transition from Step 1 to Step 2 cannot happen as 
early as in the algorithm of Section 4.2; rather, the robot waits until the spiral-search branching 
factor becomes too large, and then begins exploring several critical paths in succession before 
beginning Step 3. We elaborate below. 

Let A(n) = ae The set of n rectangles will be denoted R, and the current search 
region R’. b(R’) will now simply be a measure of the number of rectangles in R’ (including 


partial rectangles). Other notation is as before. The global structure of the algorithm is as 


follows. 


Initially b(Rk’)=1, p(R’) = O(n). 


While b(R’) < 74 


execute Step 1 
While p(B’) > <5 
execute Step 2 


Execute Step 3 


Step 1: Restricted Spiral Search 


Actually, this is not nearly as straightforward as it was for trees. We would like the robot to 
iteratively explore all parts of the environment within distance 1,2,4,... until b(R’) gets too 
large. The problem is that the robot must be careful to keep track of the distance to each 
point it encounters; that is, it must know a shortest path back to the origin yo for each point it 
reaches. Fortunately, there is a “compact search” subroutine due to Betke, Rivest, and Singh 
[BRiSi] which accomplishes just this. 

Thus Step 1 will proceed as follows. For successive values of 2’ (7 = 0,1,2,...) the robot 
explores all points within 2’ of Z(7yo). We implement stage j of this process using a simple 
modification of the compact search algorithm of [BRiSi] — the robot turns back whenever it is 


about to move more than 2/ from 7. 
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Lemma 4.12 At the end of Step 1, the robot has traveled O(b( R’)) times the length of the 


optimal solution. 


Proof. The main result of [BRiSi] is that the robot will travel at most 10 times the total 
length of all edges in the region searched. In stage 7, each rectangle (or partial rectangle) has 
perimeter at most 4-2’ and there are at most b(R’) rectangles; thus the robot travels at most 
406( R’)- 2. Summing over all stages, the distance traveled is at most 80r(R’)b( R’); meanwhile, 


the optimal solution has length at least r(.R’)/2. 


Step 2: Extending Multiple Critical Paths 


Let c(R’) denote the quantity 


min min  dp(v,v’). 
(oER') (v,v'EDa) 


That is, c(.R’) is the smallest distance between two vertices in R corresponding to the same 
vertex in R’. Let P denote the polygonal path from v to v’. As in Section 4.2, the robot, 
starting from v, tries to follow as many copies of P in succession as possible. After this, c(R’) 
and the values of v,v’ are updated, and the robot iterates. Step 2 comes to an end when p( R’) 
gets down to Xay: 

To bound the distance traveled in Step 2, we first prove a combinatorial lemma about trees 
with edge lengths. Let 7 be a tree with m vertices and maximum degree A, and let p bea 
length function on its edges. If v is a vertex of 7, Ba(v) will denote the set of all vertices within 


distance d of v and rad(r) will denote, as usual, the smallest d such that there exists a v with 


TC Bar). 


Lemma 4.13 There exists a vertex v* of 7 for which 


Proof. Let u be a vertex of 7 that realizes the radius; that is, Braa(r)(t) > 7. Forti = 


1,...,A(m)?, let 7; denote the set of all vertices in + whose distance from u is at most Xmy . 
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rad(r), and m,; = |7;|. Setting mo = 1, we know that 


A(m)* 


mM; 
I] ——=™ 


i=1 M1 


so by the pigeonhole principle there is some j for which “+ > log m. 


Consider cutting off the tree at distance waF -rad(T) from w, and let €,,...,¢, be the edges 
that cross this boundary. Since 7 has maximum degree A, and there are only m,; vertices in 
T;, we have s < Am,; now, since m;4, > m,;logm, one of the subtrees below some e, has at 


least log m/A vertices in 7;4; —7;. Thus we can let the vertex in the subtree below e;, which is 


closest to u be v* in the statement of the lemma. Bi 


Lemma 4.14 In Step 2, the robot travels at most Osta) times the length of the optimal 


solution. 


Proof. Consider building a shortest-paths tree, rooted at yo, on the vertices of the search 
region R’. (Note that we can build such a tree since we used a compact search algorithm in 
Step 1.) This tree has maximum degree four (since the obstacles are rectangles), so, applying 


Lemma 4.13, we see that there is some vertex a* with at least +(log n—log \(m)) vertices within 


a radius ae Now suppose that there are not two vertices v,v’ € D,« for which 
2r( RB’) 
d < . 4.1 
R(v, v ) = Mn)? ( ) 


Then we could pack into R a collection of disjoint balls of radius ir, each of which contains 


at most one member of D,« and at least 4(log n —log A(n)) vertices of R. But this would imply 


An An 
) < 


Dy < 
Da ~ log n —log A(n) ~ A(n) 


= p(R') 


and thus the robot would not execute Step 2 at all. 
Thus we have shown that as long as b(.R’) > x75 and p(R’) > xay there will be vertices 
v,v' € D,« satisfying Equation (4.1). As before, let P denote the polygonal path from v to v’, 


and suppose that the robot traverses P* beginning at Z(a*) but is not able to traverse P*t'. 


We will call this a short iteration if s < A(n), and a long iteration otherwise. 
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Observe that a short iteration eliminates at least one of v,v’ from D,+, so there are at most 


4r(R’) in 


2(F) so the robot travels at most 


n short iterations. Also, the length of P is at most Maye? Xin) 


one such iteration, for a total of Xin) -r( RB’). 


Now we claim that there can be at most one long iteration in Step 2. Indeed, at the end 


of such an iteration each v € D,» P-covers at least A(n) vertices, so by arguments strictly 


analogous to those in the proof of Lemma 4.9, there can be at most Xen) vertices in D,». Thus, 


Step 2 will come to an end after the first long iteration. Moreoever, s is always at most n, so 


the robot will travel at most NEN -r( R’) in this iteration. 
As the length of the optimal solution is at least r(.R’)/2, the bound follows. li 
Step 3 is implemented just as before. Thus, by Lemmas 4.12, 4.14, and the analogue of 


Lemma 4.10 for rectangles, we have 


Theorem 4.15 The above algorithm is O( 


Tiny competitive for the localization problem in an 


environment of n rectangles. 


4.4 Placing Unique Landmarks 


Until now, we have been considering the “drop-off” version of the problem [TPBHSS], in which 
the robot is placed in an environment with very little starting information. But another sit- 
uation in which localization arises is that of a robot which must repeatedly perform tasks in 
the same environment, and must begin by determining its current location. In such situations, 
it is useful to place k unique landmarks in the environment, so that the robot immediately 
knows where it is upon encountering one of them. It is not difficult to make these notions 
precise in our model. Let us simply say that a k-marking of the environment € is a function p 
from the vertices of the environment to the set {0,1,...,&}; exactly one vertex gets each value 
j=l,...,k, and the rest get the value 0. Each time the robot gets to a new vertex v, it can 
determine the value pu(v). 

For the sake of concreteness, let us consider the geometric trees of Section 4.2. The goal here 
is, for fixed k, to give a k-marking pz and an accompanying localization algorithm which achieves 
the lowest possible competitive ratio. Trying out this notion on the examples of Section 4.1, 


one finds that different environments can have strikingly different behaviors with respect to this 
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measure. For example, one must place at least \/n — o(,/n) landmarks in the tree of Figure 4- 
2 before bringing about an asymptotic improvement in the best competitive ratio attainable 
(since there will have to be a landmark in all but a o(1) fraction of the /n “blocks”). On the 
other hand, by placing a single landmark at the exact center of the long path II in Figure 4-3, 
one brings the best attainable competitive down from \/n to 9 (the robot simply uses two-way 
spiral search until it hits this landmark or one end of II). 

In light of this, we believe that the algorithmic question of finding the optimal k-marking, 
given a tree T or a set of rectangles R, is very interesting; we leave it as an open problem. In the 
remainder of this section, we turn to statements that can be made in general for environments 


of trees and rectangles. 


Proposition 4.16 For each tree T, there is a k-marking and a localization algorithm which is 


O(%)-competitive. 


Proof. This is not difficult to prove directly; and using a lemma from [LoS], we can actually 
prove the stronger statement that there is, in effect, a single marking which works for all k. 
Specifically, it is proved in [LoS] that there is a numbering w of the n vertices of T (i.e. a 
bijection from V to {1,...,n}) so that for each k, the removal of the vertices numbered 1 


through & results in a forest in which no component has more than 22 vertices. 


Given , we define the k-marking py, in the natural way: 


viv) ifl<po)<k 


0 otherwise 


[in(v) = 


Given this marking, the localization algorithm is rather unsubtle: the robot performs spiral 
search until p(7’) decreases to 1 or it reaches a landmark (at which point p(T’) immediately 
equals 1). Since it is traveling in a component with at most vertices, 6(T’) will never exceed 
and the result follows. li 

Unfortunately, we see that no stronger statement can be made at this level of generality 
for trees, since the trade-off in Proposition 4.16 is tight up to constant factors for the tree of 
Figure 4-2 when k = /n. For the case of rectangular obstacles, we can prove a similar trade-off 


non-constructively. 
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Proposition 4.17 For each environment R of n rectangles, there is a k-marking and a local- 


ization algorithm which is O(2%8*)-competitive. 


Proof. Note that the statement is trivially true if & is not at least Q(log n), so we will assume 
that it is in what follows. Also, we only consider markings in which landmarks are placed at all 
four corners of k/4 rectangles. Thus, by abuse of notation, we will also speak of y(R;), where 
R; is a rectangle. The localization algorithm will be the same as in Proposition 4.16: perform 
spiral search as in Step 1 of Section 4.3 until p(k’) decreases to 1 or a landmark is reached. To 
prove the stated bound, we must show that for some marking p, the robot will travel no more 
than O( 288") times the length of the optimal solution, regardless of its starting position. 

In fact, we claim that if ys is constructed by randomly marking &/4 rectangles, it will have 
this property with high probability. To prove this, we define a numbering ~, of the rectangles 
for each of the 4n vertices in the environment: 4, will tell the order in which the rectangles 
are encountered when the spiral search algorithm is performed beginning at v (i.e. Y,(R;) -— 1 


rectangles are encountered before rectangle R;, starting from v). Say that has Property W if 


8ninn 


k 


Vv JR; : w(R;) > 0 and w,(R;) < 


The probability that y fails to have Property V for a single vertex v is at most 


and so the probability that y fails to have Property WV for any vertex is at most 4, 


So consider a marking 4 which does have Property WV. Regardless of the robot’s starting 


8nlinn, 
Rk) 


location, it will encounter a landmark by the time b( R’) reaches the result now follows 


from Lemma 4.12. I 


Appendix: Proofs of Claims 4.5 and 4.6 


Recall that a subset U = {w,,...,u,} of the vertices of T induces a periodic path if T(U) 
is a simple path and there is some polygonal path P such that T(w;,u:41) = P™ for natural 


numbers m,,...,™,-1; and U induces a comb tree if T(U) consists of a periodic path on vertices 


63 


{v,...,v,} distinct from U together with disjoint paths T(u;,v;), all of which are mutually 
congruent. 

Proof of Claim 4.5. Let $ denote T(r, x,y,z), P denote the polygonal path T(r, 2) ~ T(y, 2), 
and @ denote T(r,y) ~ T(a#,z). Let us suppose by induction that the claim holds for all 
examples in which the total number of edges in P and @ together is smaller than in S (note 
that the result clearly holds when P and Q each consists of a single edge). We consider three 
separate cases, based on the number of leaves in S. 

Case 1: S$ has two leaves; suppose that these are r and z (other cases are similar). Then 
since C’ has no branch-vertices in this case, the polygonal paths PQ and QP are congruent. It 
is straightforward to show that this implies PQ is a periodic path. 

Case 2: S$ has three leaves. This is actually impossible; suppose that x, y, and z are all 
leaves (other cases are symmetric). Then since x is a leaf, P~' and @ have the same initial 
direction; since z is a leaf, P~! and Q7' have the same initial direction; and since y is a leaf, P 
and Q7-! have the same initial direction. Thus P and Q have the same initial direction, which 
implies that r must be a leaf. 

Case 3: 5S has four leaves. Then P, Q, P~', and Q7! all have the same initial direction. 
We eliminate the shortest of these four initial edges, and shorten the other three initial edges 
by the same amount. In this way, we have an example $’ with one fewer edge in P or Q, so 


the claim holds for $’. This in turn implies that S$ is a comb tree. m 


Proof of Claim 4.6. [If all triples form periodic paths, then clearly T(U) is a periodic path. 
So suppose T(x, y,z) is a comb tree. Then since 7'(2,y) has the same direction at both ends, 
T(z, y, 2’) must be a comb tree for all z’ € U. Now if T(, p,q) is a periodic path, then T(2, p) 
cannot have the same initial direction at both ends; thus T(2,p,q’) is a periodic path for all 
gq € U. From this it follows that if any triple constitutes a comb tree, then all triples do. Also, 
we can conclude that all vertices are leaves in T(U). 

Next, observe that for all z, 2’ €¢ U, has the same support point in T'(a, y, z) and T(a, y, 2’) 
(it is the maximal sequence of line segments which is the same at the beginning and end of 
T(z,y)). Thus, it has the same support point in all comb trees. We now conclude that all the 
vertices in U have disjoint congruent paths joining them to the rest of T(U). Deleting these 


paths, we obtain |U| support points joined by periodic paths. Thus 7(U) is a comb tree. m 
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Chapter 5 


Real-Time Server Algorithms 


The trade-off between the computational resources of an on-line algorithm and the competitive 
ratio it can achieve is a question that has received relatively little attention. In some sense, it 
is not hard to see why this should be the case: proving lower bounds for traditional algorithms 
is notoriously difficult, and introducing the on-line/off-line distinction only seems to add to 
the complications. Thus establishing a non-trivial relationship between two measures such as 
running time and competitive ratio appears to be mainly beyond the reach of current techniques. 

Nevertheless, it seems worth considering the problem at least for the simplest kind of on-line 
algorithms. What we have in mind is the following. Consider an on-line problem that can be 
expressed in the framework set up in Chapter 2: an algorithm A is presented with a sequence of 
requests o = {o,,0,...}in order; when o; is presented, A must perform some computation. We 
are interested in considering algorithms A that can meet some fixed time bound in processing 
each request; in particular, A should not slow down as more and more requests are presented. 
Thus, we (informally) define a real-time algorithm to be one which uses a constant amount of 
space, and constant time per request o;. Of course, the notion “constant” will depend on the 
particular problem, but the running time of A should in any case not depend on the number 
of requests seen so far. 

The 2-server problem appears to be a good one on which to try out these notions. First 
of all, a fair amount is known about the bounds one can achieve — there are at least three 
optimal 2-server algorithms in the literature [MMS, CL1, CL4] — and a number of constant- 
time algorithms have been analyzed [IR, CL2, CKPV]. Also, the structure of the 2-server 
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problem is in some sense quite a bit simpler than the structure of the general k-server problem. 

In this setting, a real-time algorithm is one which decides which server to send to each 
request using an amount of time and space independent of the cardinality of the metric space 
and of the number of points requested so far. None of the known optimal 2-server algorithms 
have this property; for example, in an infinite metric space such as R?, they slow down with 
each additional request. Indeed, currently the best real-time algorithm known has a competitive 
ratio of 4 [CL2] (improving on a 10-competitive algorithm [IR]); and until now, the only known 
metric spaces in which a fast algorithm could achieve the optimal ratio of 2 were those which 
could be embedded in a tree. 

All of this leads to a very interesting open question: is there a real-time 2-server algorithm 
with a competitive ratio of 2? We do not settle this question here; however, we give two 
results which provide evidence in opposite directions. Note also that the question is essentially 
independent of the k-server conjecture, which asks how well an on-line algorithm can perform 
regardless of the amount of computation it requires. 

First we present a real-time, memoryless 2-competitive algorithm for two servers in R4, for 
any dimension d, under the L, (“Manhattan”) metric. This result considerably extends the class 
of metric spaces for which there is known to be a fast, optimal algorithm. Moreover, the original 
real-time 2-competitive algorithm for trees was based on a very elegant memoryless strategy, 
“Double-Coverage,” that was first discovered for servers on a line [CKPV] (this algorithm 
was discussed in Chapter 2). However, attempts to extend this technique to produce optimal 
deterministic algorithms in spaces more general than trees had so far not been successful. Thus, 
an interesting feature of our main result is that it follows the style of Double-Coverage, and 
can therefore be viewed as an extension of that algorithm to the case of two servers in higher 
dimensions. 

In Section 5.2, we turn to the question of lower bounds. A number of papers have proposed 
“balancing” algorithms for the k-server problem. The basic balancing algorithm works as 
follows: for each server s;, its total distance traveled is maintained in the variable D;; when 
a request is made at a point r, the algorithm sends the server which minimizes D; + rs; (for 
z,y € M, let xy denote the distance between them). This rule was shown to be k-competitive 


for k servers when the cardinality of the request space M is k+1 [MMS], and for the “weighted- 
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cache” problem, which includes the paging problem as a special case [CKPV]. 

However, the algorithm does not achieve any constant competitive ratio, even for two servers, 
in a general metric space M. Thus it was somewhat surprising that a rule minimizing the 
quantity D; + 2rs; was shown to be 10-competitive for two servers [IR]. A later construction 
showed that this algorithm is no better than 6-competitive [CL2]. 

Here, we show a new lower bound for the class of balancing algorithms in general. Let 
f : Rt —R, and B; be the server algorithm which does nothing when the request point is 
already covered, and otherwise moves the server which minimizes D; + f(rs;), where D, is the 
total distance traveled by server 7. We will describe B; as a balancing algorithm with cost 
function f. Observe that we make no restrictions whatsoever on the nature of the function f. 

Our main result is a lower bound of (5 + V7)/2 (~ 3.82) on the competitive ratio of any 
such balancing algorithm for two servers. In view of the 2-competitive algorithms of [MMS, 
CL1, CL4], this shows that no optimal on-line 2-server algorithm can be expressed as a decision 
rule B; for any f. To our knowledge, this represents one of the first lower bounds for on-line 
algorithms based solely on computational resources. 

We use the following notation. For a server algorithm A, let , (A) denote its competitive 
ratio. If A is ccompetitive for some c, we will say it is competitive, otherwise, we write 
,(A) = ow. As before, OPT will denote the optimal off-line server algorithm, whose cost 
on a request sequence o is always the minimum possible; we sometimes refer to OPT as the 


“adversary” algorithm. 


5.1 Two Servers in Euclidean Space 


Let x‘ denote the 7 coordinate of c € R*. For two points z,y € R%, define 7,(2,y) to be 
the closed d-dimensional box (possibly degenerate) with « and y as opposite corners. That is, 
w € R“ lies in ra(z, y) if and only if w' is between min(z', y') and max(a',y') for? =1,...,d. A 
point z € R4 induces the following partial order <, on R4: x <, y if and only if x is contained 
in the box Ta(y, z). It is easily verified that <, satisfies the properties of a partial order. 

First consider the following problem. We have two sets P and Q, each consisting of n points 
in (R“, £,), and we wish to find a minimum-cost perfect matching between them. The following 


lemma is central to the analysis of the algorithm. 
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Lemma 5.1 For each q € Q, there is a minimum-cost matching M in which q¢ is matched to 


one of the points of P which is minimal with respect to <,. 


Proof. Consider a matching M‘ in which p’ € P is matched to q, and p’ is not minimal with 
respect to <,. In this case, there is some minimal p € P such that p <, p’; suppose that p is 
matched to some gq’ € @. We can draw a shortest path from p’ to q that passes through p; thus 
the matching M in which p is matched to q and p’ is matched to q’ has cost at most that of 
M'.t 


The Algorithm in the Plane 


We first give the algorithm in the plane, and then show how to extend it to higher dimensions. 
We number the quadrants of the plane in clockwise order, beginning with the positive quadrant. 
As is standard in analyzing server algorithms, we view the request sequence as a game between 
the algorithm and an omniscient adversary. In each round of the game, the adversary generates 
a request and serves it optimally (using knowledge of the future requests). The algorithm then 
must also serve the request, without knowing anything about the behavior of the adversary 
servers or the remainder of the request sequence. 

Assume that the algorithm’s servers are at points s; and s., the adversary servers are at 
points 0, and o., and a request comes to some point in the plane. Since the £, metric is 
invariant under translations and 90-degree rotations, we can assume that the request is at the 
origin and that the on-line server closer to the request, s,, lies in the first quadrant. 

First, the algorithm computes the partial order <, with respect to the origin, on the two 
servers. Clearly s, will be a minimal element. If sy is not also minimal, we simply move s, 
along a shortest path to the origin. Otherwise, both are minimal and we have the following 


cases: 


1. 8» lies in the third quadrant. Here we move both servers at constant speed along shortest 


paths to the origin until s; reaches it. 


2. S lies in the second quadrant. Here, we move both servers at constant speed to the 
positive y-axis — if s) reaches it first, we also move it down the y-axis — until s. is no 


longer minimal. 
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3. 5 lies in the fourth quadrant. This is strictly analogous to the previous case. 


4. s» also lies in the first quadrant. Then consider the rectangle 7(s;, 52), and let y be the 
corner of this rectangle closest to origin. We move both servers toward y until s; reaches 


it; now Ss, is no longer minimal. 
Theorem 5.2 The above algorithm is 2-competitive in the L, plane. 


Proof. We analyze the algorithm’s performance using the CDRS potential function [CDRS] 


© = 8189 + 2Mmin(S,OPT), 


where Mnin(S,OPT) denotes the value of the minimum-cost matching between the adversary 
servers and the on-line servers, and 5,52 is, as before, the distance between s, and sy. By the 
standard potential-function argument, the following two facts will imply that the algorithm is 


2-competitive. 
1. When the adversary moves, paying c, it can raise ® by at most 2c. 
2. When the algorithm moves, paying c’, it lowers ® by at least c’. 


The first of these facts holds in any metric space, and has been proved in earlier papers [CKPV, 
CL3, CDRS]; the argument is simply that the value of the minimum-cost matching goes up by 
at most c’ while the first term of ® is untouched. 

Now consider the second fact. We break the behavior of the algorithm into two phases. 
In the first (possibly empty) phase, both servers are moving; in the second phase, only one 
server is moving. In the second phase, in which one server moves a distance d, the value of first 
term of © increases by at most d, while Lemma 5.1 implies that the value of the minimum-cost 
matching goes down by d. Thus ® goes down by at least d, as desired. 

When the algorithm is moving both servers (each a distance d’) the matching component 
cannot go up (whichever server is matched to the origin is decreasing its contribution to the 
cost optimally). Meanwhile, examining the four cases above, we see that the first term of ® 


goes down by 2d’. Thus, ® decreases by at least 2d’, and the proof is complete. fi 
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Under the Euclidean metric, the on-line servers can follow the above algorithm and pay at 
most the £, cost. Meanwhile, the adversary only has to pay the L. cost, which can be smaller 


by at most a factor of 2. Thus we have 


Corollary 5.3 The algorithm is 2\/2-competitive (~ 2.83) in the Euclidean plane. 


The Algorithm in d Dimensions 


Consider now the case of two servers in R%, under the L, metric. As before, by a translation 
of R4, we may assume that the request is at the origin; we also relabel the servers if necessary 
so that s, is closer to the request. 

The algorithm in d dimensions works as follows. If only one server is minimal, it moves 
on a shortest path to the origin while the other remains fixed. Otherwise, both servers are 
minimal. Let s; = (s{,...,s/) and s2 = (s3,...,s84) in coordinates; we proceed in the following 


two stages: 


1. As long there is a j for which s} and s) have opposite sign, we move the servers at the 


same speed towards the hyperplane {z : 27 = 0} so as to bring one of these values to 0. 


2. The servers begin the second stage with s; and s, having the same sign for each j; that 
is, they lie in the same closed orthant. By relabeling the axes, we may assume this is the 


positive orthant. Let y be the corner of Tg(s1, 5) closest to the origin; that is, 


y = (min(s!,s1),...,min(sf, sf), 


As before, denote the j'® coordinate of y by y’. The servers repeatedly execute the 
following step: each picks a j for which s? — y is strictly positive, and moves at constant 
speed towards the hyperplane {x : 2’ = y/} so as to decrease s! to y’. Each such move 
reduces the number of coordinates 7 for which s! —y! > 0 for one of the servers s;. Thus, 
after at most d such steps, server s, will be sitting on y. At this point, only s; is minimal, 


and we proceed as above. 


Theorem 5.4 The algorithm is 2-competitive under the L, metric in R‘. 
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Proof. We use the same potential function ®. The analysis of the adversary’s move and the 
case in which only one on-line server is minimal are handled as in the proof for two dimensions. 
When both on-line servers move a distance d, we must show that the potential ® drops by at 


least 2d. This will be implied by the following property: 


(*) Both servers are moved a distance d closer to the origin and 2d closer to each other 


simultaneously. 


The motion of the servers in the first stage of the algorithm clearly satisfies this property. 
Observe that throughout the second stage, there is a shortest path from s; to s. that passes 
through the corner y of Tq(51, 52). But each step in this stage brings a server simultaneously 


closer to the origin and to y; thus Property (*) is maintained in the second stage as well. i 


Corollary 5.5 The algorithm is 2/d-competitive under the Euclidean metric in R4. 


5.2 A Lower Bound for Balancing Algorithms 


To prove the lower bound, we begin by “classifying” the function f used in the balancing rule in 


a certain way. We will say that a property P of positive real numbers holds “e.f.” (everywhere 


but in a finite interval) if dapV(@ > v)P(#). Similarly P holds “a.l.” (for arbitrarily large 


reals) if Vaod(a > 2)P(«). For any function f : Rt — R, and p € Rt, say that f € y(p) 
if f(z) < pa ef. If for some such p, f € y(p), we will write py = inf{p: f € y(p)}; pp = ~ 
otherwise. Observe that f does not necessarily belong to y(p;), since the infimum is not 
necessarily attained. 

Finally, the behavior of B; is ambiguous when D; + f(rs;) = D; + f(rs;) for i #7, and i,j 
both minimize this expression. The standard convention here is to let the adversary break the 
tie; in any event, none of our constructions rely on such degeneracies. 


For the remainder of this section, all server algorithms will be 2-server algorithms. 
Lemma 5.6 If B; is competitive, then f(x) > 0 e.f. 


Proof. Assume by way of contradiction that f(z) < 0 al. and that B; is c-competitive for 
some c. Choose zo so that f(a) < 0 and No large enough so that No > cao and No > —f(2o). 
Since f(z) <0 a.L, we can find an N > No such that f(.N) < 0. 
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Figure 5-1: Metric Space M, 


Consider the metric space M, in Figure 5-1, with WX =YZ=%,WY =WZ= XY 


XZ=N. Server s; begins on W and server sy on X. The first request is to the point Y. Since 
both servers are the same distance from Y, we may assume that s,; responds. Now we request 
point 7; since 


server s) will respond. At this point, By has paid 2N > 2cao, while OPT’, by using one server 
to cover requests to W, X and the other to cover Y, Z, need pay only 2). Moreover, the situation 
is now symmetric to the beginning, so we can repeat the process indefinitely, contradicting the 
claim the By; is c-competitive. Ii 

Armed with this lemma, we can prove a much stronger result, generalizing the observation 


that the basic balancing algorithm is not competitive. 
Lemma 5.7 If B; is competitive, then p; > 1. 


Proof. Assume by way of contradiction that p; < 1 and B; is c-competitive. We have two cases 


to consider: f € y(1) and f ¢ y(1). 
If f € y(1), then for some x > 0,0 < f(x) < a for all > a. Consider metric space M,, 
with WX = YZ = ao, all other lengths equal to N = cx, and the servers initially on W,X. 


Suppose we request point Y and s, responds. If we now request point 7, then since 


server S» will respond. Moreover, this process can be continued indefinitely. Thus By pays 2ex 
in each round of this sequence, while OPT need pay only xo, as in the above arguments. 


If f g y(1), then we can still find some y > 0 such that 0 < f(x) < (1+ 4)a for all « > y. 
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Moreover, since f ¢ y(1), we can find some 2» > y such that f(a) > %/2. Construct metric 
space M, with WX = YZ = xo, all other lengths equal to cap, put s, on W, s. on X, and 


request point Y. Suppose s; responds; we now request point Z. Since f(a) > xo/2, 
1 1 
eto + flo) > (e+ 5)to = (1+ Zero > Flexo), 


the last inequality following since ex) > y. Thus sy will respond. In this way, By pays 2ex%) on 


this sequence, while OPT need only pay x. Bi 


o 2 


Figure 5-2: Metric Space Ms 


Lemma 5.8 [f p; = p> 1, then, (By) > p/2+ 3/2. 


Proof. We show that for each 8 > 0, there exist arbitrarily long request sequences o for which 
B;(o) > (p/2 + 3/2 — B)OPT(c); this will establish the lemma. We construct a countably 
infinite metric space Mz as follows. The “core” of the space is a short segment XY of length do, 
chosen so that pdy/2 < f(do) < 2pdo. The sequence oa is built in phases; in phase j, 7 = 1,2,..., 
all the requests are to X, Y, and a point Z; at distance d; from both X and Y. (See Figure 5-2.) 


Let a; denote the sequence up through the end of phase 7. 
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Let D; denote an d;, and Dj denote the sum of D; and the total distance traveled by 
both on-line servers in all previous phases. We fix some very small a > 0, to be determined 
below, and since py = p, we can choose d; such that Dj < adj, and f(d;) > (p— 3/2)d;. Phase 
j begins with the two on-line servers sitting on X and Y. First the point Z; is requested; then 
the points X and Y are requested repeatedly until the on-line server at Z; returns to the core 
XY; at this point the phase comes to an end. 

In this phase, one adversary server moves out to Z; and immediately returns on the next 
request, for a cost of 2d;. Meanwhile, the on-line server covering X and Y has built up a 


distance of no more than D‘ from all previous phases, so it must move more than 


d; + f(d;) — (Dy + f(do)) > dj + (p— B/2)d; — (ad; + 2pad;) 


= (pt 1d; — (2p + ad; — (8/2)d; 


before the server at Z; is selected to return. Thus, the on-line servers move a distance of at 


least 


(p + 3)dj — (2p + Lad; — (8/2)d; 


in this phase, while the adversary servers move no more than 2d; + 2ad; in all phases up to 


this one. By choosing a small enough, we can ensure that 


(p + 3) — 2p t+ Ia — (8/2) 


Ale) 74a > p/2+3/2- 8. 


Since we can continue this construction for an arbitrary number of phases, the result follows. 


It is clear that a very similar construction involving metric space M, shows that if py = ~, 


then B; is not competitive. The final lemma is based on a construction in [CL2]. 
Lemma 5.9 [f p; = p> 1, then, (B;) > 3p/(p—1). 


Proof. In the style of the previous proof, we show that for each @ > 0, 


(By) > Pe — 8. 


Figure 5-3: Metric Space Ms 


Choose positive 6 < 3/2 and 


2(p— 1)? 
< 2eo Uy 


4p — 1 
Now take N large enough so that f(z) < (p+e)x for all a > N and, since p; > p—€, we can 
find a > N such that f(a) > (p—e)a. 
Consider the metric space M3 of Figure 5-3. Triangles UVW and XY Z have sides of length 


a; the three edges crossing between the triangles have length 


Put server s; on U and sy on V. We place the adversary servers on V and Z. Also, set 
D, = a/2 and Dz = 0 (this can be easily accomplished by starting one of the on-line servers 
from a seventh point in the metric space). 

We construct a request sequence as follows. First we request Z, which will be served by 59. 
We then request X — as shown below, this will be served by s,. At this point, Dz exceeds D, 
by a/2, and the situation is symmetric to the beginning. The optimal algorithm can use one 


server to cover each triangle and hence pays a; B; pays 26 + a, for a ratio of 


2b on +1 
fel i oe 26> 


a p— p— 


3p 
ree tae 


Thus we need only verify that s, will serve the request at X. This follows because 


a 2Qp+1 2p* + 2p—1 2Qp+1 ) 
bree eee Be ae eee ee ee, 
i + f(6) s+ ((FS Ja) <a page Ngee 
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which is less than or equal to 


2p? + 2p-1 ) 
———— —-é-«]. 


Dat fla) = a4 b+ fla) > «(PE 


i 
Theorem 5.10 For all f, , (B;) > (5+ V7)/2. 


Proof. Tf p, < 1 or py = o©, then By is not competitive. In the case where 1 < py < ~™, 


Lemmas 5.8 and 5.9 imply that 


, (By) > max{ 2h Prt 3) 
pp-l 2 


This expression is minimized at p; = 2+ V7, with a value of (5+ /7)/2. 


As noted above, we have the following interesting corollary. 


Corollary 5.11 No optimal on-line 2-server algorithm can be expressed as B; for any function 


f. 
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Chapter 6 


Conclusion and Open Problems 


6.1 Robot Navigation 


We have considered two basic on-line navigation problems: searching for a goal in an unknown 
polygon, and determining one’s location in an environment for which the map is known (local- 
ization). A number of questions are left open by this work; we summarize some of these below, 


along with a number of other possible directions for future work. 


1. To begin with, there are gaps between the upper and lower bounds that we can show for 
the localization problem in both kinds of environments considered in Chapter 4. Certainly 
for the case of rectangular obstacles, it seems that there should be room for improvement 
on the upper bound. Indeed, it is quite possible that essentially the same algorithm given 
here actually achieves a much better competitive ratio; it appears that a stronger analysis 


of the geometry of the rectangle packing is needed. 


Also, as suggested in Section 4.4, the problem of placing landmarks brings up some 
interesting algorithmic questions. Here is one concrete decision problem: given a tree 
T and an integer k, does there exist a k-marking p and an accompanying localization 
algorithm with competitive ratio at most 9? (9 appears to be the right constant to use 


here, as it is the best competitive ratio attainable for the search problem on a line.) 


2. The results of Chapter 3 apply only to the case of simple rectilinear polygons; i.e. those 


with no holes. [DKP] conjectures that a constant competitive ratio should be attainable 
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for the problem of exploring a rectilinear polygon containing an arbitrary number of 
rectilinear holes. (They give an algorithm which is O(k)-competitive, where k is the 


number of holes.) 


Since a number of variations on this problem are possible, let us make precise what we 
mean: we want the robot (whether on-line or off-line) to traverse a path from which it can 
see all points in free space and on the boundaries of the obstacles. This question appears 
to be rather difficult (and the off-line version is NP-Complete, by a simple reduction from 
the Traveling Salesman problem); perhaps a good starting point would be the special case 


of a large square containing an arbitrary number of unit squares as obstacles. 


Of course, one could also consider the search problem in this setting; it is not clear in this 


case what kind of bounds one would be trying to achieve. 


3. The problem of using multiple interacting robots effectively is one that comes up fre- 
quently in the larger robotics literature. Most of the previous theoretical work on multiple 
robots is somewhat orthogonal to these concerns; it has tended to deal with computation- 
ally limited robots that use one another mainly for “pebbling” purposes (see e.g. [BK] 


and the references therein). 


We are interested in the following sort of problem. One has & independent robots trying 
to solve a search problem in some geometric environment; the quantity being minimized 
in this situation is the time taken to find the goal, rather than the total distance traveled.! 
If the best competitive ratio attainable by a single robot in this environment is c, can one 
give algorithms which are O(775)-competitive, where f(k) is a reasonably fast-growing 
function? 

This question resembles some of the load-balancing problems that have been considered 


from the perspective of competitive analysis; essentially we are asking how “parallelizable” 


the search problem is in the on-line setting. 


4. Much of the appeal of Klein’s street problem (Chapter 3) is in the definition of a street 


itself: it is a natural type of polygon in which a constant competitive ratio can be achieved 


'For a single robot, time and distance tend of course to be interchangeable in most of these problems. 
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for the search problem. We are interested in looking for other types of polygons in which 


competitive search is possible. 


One obvious possibility is the class of star-shaped polygons (P is star-shaped if there is 
some point inside P that can see the entire boundary): can a robot starting at s find 
a point ¢t while traveling a constant factor times d(s,t)? A related and probably easier 
question is that of whether a constant competitive ratio can be achieved for the problem 


of finding a member of the kernel of P (i.e. the set of “star points” of P). 


Of course, star-shaped polygons are a rather simple class, but studying them may suggest 
more general algorithms. For example, suppose we define the idlumination number of 
P to be the minimum number of point-source “light bulbs” needed to provide direct 
illumination to the entire boundary of P from its interior (so P is star-shaped iff its 
illumination number is one); now we can look for algorithms that achieve a constant ratio 


for searching polygons with fixed illumination number. 


. What does it take to build a maze? Specifically, it is interesting to ask oneself what 
constitutes the difference between the “shortest-path” problems of [PY, BRaSc, BBFY], 
and the geometric search problems [K, K]] of the type discussed in Chapter 3. Initially, 
one is tempted to conclude that the difference is in whether the coordinates of the goal 
are known; but, as any good labyrinth-designer knows, it is possible for the entrance and 
exit of a maze to be physically right next to each other, and for the maze still to be very 


difficult. 


In other words, when the obstacles can be sufficiently complex, there is no value in knowing 
the coordinates of the goal. We can make this more precise as follows. Let F denote a 
family of rectilinear polygons, which will be used as obstacles in the sense of [PY] (e.g. F 
could be the set of all rectangles, or the set of all L-shaped polygons, and so on). As 
usual, we will require that all members of F have at least unit thickness, and that they 
cannot be “glued together.” Consider now the navigation problem in which the plane is 
filled with members of F, and a robot starting at s is given the coordinates of a point t 
which it must travel to. Let n denote the straight-line distance from s to ¢; recall that 


this could be much less than d(s,t), the length of the shortest obstacle-avoiding path. 
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Say that F is maze-inducing if for no function f is there an f(n)-competitive on-line 
algorithm; that is, the competitive ratio for the navigation problem cannot be bounded 
in terms of the straight-line distance from s to t. Thus, the results of [BRaSc] show that 
the class of rectangles is not maze-inducing. On the other hand, it is not difficult to build 
mazes when F is the set of all rectilinear polygons (with at least unit thickness). So 
a natural question would be to try giving some characterization of classes F which are 
maze-inducing. For example, is it true that no F consisting of polygons with a uniformly 
bounded number of vertices, or even a uniformly bounded illumination number, can be 
maze-inducing? Along these lines one could define, by analogy with the illumination 
number, the segment number of P to be the minimum number of line-segment light sources 
needed to provide direct illumination to the entire boundary of P from its interior. We 
can show it is possible to build mazes when F is the set of polygons with segment number 
at most three, but do not know whether it is possible when the segment number is at 


most two. 


Of course, there are many other issues one could consider. As mentioned at the beginning 
of Chapter 4, most on-line navigation problems studied up to this point have maintained a 
fairly binary distinction between having a map and having no map. But many naturally arising 
robotics problems involve navigation when the robot has some limited information about its 
relation to the environment. The localization problem is a step in this direction, as are the 
papers of Donald [Don] and Blum and Chalasani [BC]. 

As in the case of several other on-line problems, there are numerous “design choices” one 
faces when formulating a navigation problem of the variety studied here. We believe it is 
important in this process to keep in mind the problems faced by designers of autonomous 
mobile robots. It is not necessary that the theoretical algorithms produced in this setting be 
directly implementable in contemporary robots; but they seem to be most valuable when they 


provide some insight into the structure of navigation problems faced in real-world applications. 
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6.2 Servers 


We have addressed the notion of real-time 2-server algorithms — those that only use only 
a constant amount of space, and constant time per request. It is well-known that the best 
competitive ratio attainable by a 2-server algorithm is 2, and this can be achieved by several 
algorithms, all of which perform a substantial amount of computation on each request. We have 
shown a 2-competitive real-time algorithm in nm dimensions, for every n, under the Manhattan 


metric. On the other hand, we have shown that the class of balancing algorithms (a subset of 
b+/7 
2 


the real-time algorithms) cannot achieve a competitive ratio better than in general. 

There are some immediate questions one could consider in light of these results, as well as 
a more global one. Specifically, can our algorithm for the Manhattan metric be adapted to the 
standard Euclidean metric while maintaining the competitive ratio of 2? Can it be extended 
to the case of three or more servers? (We can show that most natural generalizations in this 
direction are not 3-competitive.) It would also be interesting to try improving the upper bounds 
for 2-server balancing algorithms; the lower bound we show in Chapter 5 is still below the best 
known competitive ratio for a general real-time algorithm. 

But at a higher level, we believe the fundamental problem here is to decide whether or 
not there is a real-time 2-server algorithm with a competitive ratio of 2. We do not venture a 
conjecture; the results of the previous chapter present evidence for both possible answers. Of 
course, one could ask the analogous question for more than two servers, where all these problems 
become much murkier ... but the main point is this: lower bounds for on-line algorithms have 
typically been based simply on information-theoretic arguments alone; that is, regardless of how 
the algorithm makes its decisions, there will be input sequences on which it performs badly. The 
hope is that as we begin to understand this area better, we can also begin making distinctions 


based on the computational resources an algorithm has at its disposal. 
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